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NUCLEIC ACID MOLECULES ENCODING A TRANSMEMBRANE SERINE 
PROTEASE 20, THE ENCODED POLYPEPTIDES AND METHODS BASED 

THEREON 

RELATED APPLICATIONS 

5 Benefit of priority is claimed to U.S. provisional application Serial No. 

60/302,939, filed July 3, 2001, to Edwin L. Madison and Edgar O. Ong, entitled 
"NUCLEIC ACID MOLECULES ENCODING TRANSMEMBRANE SERINE 
PROTEASE 20, THE ENCODED PROTEINS AND METHODS BASED THEREON." 
Where permitted, subject matter of this application is incorporated in its entirety 

10 by reference thereto. 
FIELD OF INVENTION 

Nucleic acid molecules that encode proteases and portions thereof, 
particularly protease domains are provided. Also provided are prognostic, 
diagnostic and therapeutic methods using the proteases and domains thereof and 

15 the encoding nucleic acid molecules. 

BACKGROUND OF THE INVENTION AND OBJECTS THEREOF 

Cancer, which is a leading cause of death in the United States, is 
characterized by an increase in the number of abnormal neoplastic cells, which 
proliferate to form a tumor mass, the invasion of adjacent tissues by these 

20 neoplastic tumor cells, and the generation of malignant cells that metastasize via 
the blood or lymphatic system to regional lymph nodes and to distant sites. 
Among the hallmarks of cancer is a breakdown in the communication among 
tumor cells and their environment. Normal cells do not divide In the absence of 
stimulatory signals and cease dividing in the presence of inhibitory signals. 

25 Growth-stimulatory and growth-inhibitory signals, are routinely exchanged 
between cells within a tissue. In a cancerous, or neoplastic, state, a cell 
acquires the ability to "override" these signals and to proliferate under conditions 
in which normal cells do not grow. 

In order to proliferate tumor cells acquire a number of distinct aberrant 

30 traits reflecting genetic alterations. The genomes of certain well-studied tumors 
carry several different independently altered genes, including activated 
oncogenes and inactivated tumor suppressor genes. Each of these genetic 
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changes appears to be responsible for innpartjng some of the traits that, in the 
aggregate, represent the full neoplastic phenotype. 

A variety of biochemical factors have been associated with different 
phases of metastasis. Cell surface receptors for collagen, glycoproteins such as 
5 laminin, and proteoglycans, facilitate tumor cell attachment, an important step in 
invasion and metastases. Attachment triggers the release of degradative 
enzymes which facilitate the penetration of tumor cells through tissue barriers. 
Once the tumor cells have entered the target tissue, specific growth factors are 
required for further proliferation. Tumor invasion and progression involve a 

10 complex series of events, In which tumor cells detach from the primary tumor, 
break down the normal tissue surrounding it, and migrate Into a blood or 
lymphatic vessel to be carried to a distant site. The breaking down of normal 
tissue barriers is accomplished by the elaboration of specific enzymes that 
degrade the proteins of the extracellular matrix that make up basement 

1 5 membranes and stromal components of tissues. 

A class of extracellular matrix degrading enzymes has been implicated in 
tumor invasion. Among these are the matrix metalloproteinases {MMP). For 
example, the production of the matrix metalloproteinase stromelysin is 
associated with malignant tumors with metastatic potential (see, e.g., McDonnell 

20 eta/. (1990) Smnrs. in Cancer Biology 7:107-115; McDonnell et al. (1990) 
Cancer and l^etastasis Reviews 5:309-319). 

The capacity of cancer cells to metastasize and invade tissue is facilitated 
by degradation of the basement membrane. Several proteinase enzymes, 
including the MMPs, have been reported to facilitate the process of invasion of 

25 tumor cells. MMPs are reported to enhance degradation of the basement 
membrane, which thereby permits tumorous cells to invade tissues. For 
example, two major metalloproteinases having molecular weights of about 70 
kDa and 92 kOa appear to enhance ability of tumor cells to metastasize. 
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Type II Transmembrane Serine Proteases 

In addition to the MMPs, serine proteases have been implicated in 
neoplastic disease progression. Most serine proteases, which are either 
secreted enzymes or are sequestered in cytoplasmic storage organelles, have 
5 roles in blood coagulation, wound healing, digestion/ immune responses and 
tumor invasion and metastasis. A class of cell surface proteins designated type 
II transmembrane serine proteases, which are membrane-anchored proteins that 
often include additional extracellular domains, has been identified. As cell 
surface proteins, they are positioned to ptay a role in intracellular signal 

10 transduction and in mediating cell surface proteolytic events. 

Cell surface proteolysis is a mechanism for the generation of biologically 
active proteins that mediate a variety of cellular functions. Membrane- 
associated proteases include membrane-type metalloproteinases (MT-MMP), 
ADAMs (proteases that contain disintegrin-like and metalloproteinase domains) 

1 5 and the transmembrane serine proteases. In mammals, at least 1 7 members of 
the transmembrane serine protease family are known, including seven in humans 
(see, Hooper eta/. (2001) J. Bio/, Chtem. 276:857-860). These include: corin 
(accession nos. AF1 33845 and AB013874; see, Yan et a/. (1999) J. BioF. Chem. 
274:14926-14938; Tomia eta/. (1998) J. Biochem. ;24:784-789; Uan eta/. 

20 (2000) Proc. Nat/. Acad. Sci. U.S.A. 57:8525-8529); enteropeptidase (also 
designated enterokinase; accession no. U09860forthe human protein; see, 
Kitamoto eta/. (1995) Biochem. 27: 4562-4568; Yahagi eta/. (1996) Biochem. 
Biophys. Res. Commun. 275:806-812; Kitamoto et ai. (1994) Proc. Nat/. Acad, 
Sci. U.S.A. 57:7588-7592; Matsushima et ai. (1994) J. Bio/. Chem. 265:19976- 

25 19982;); human airway trypsin-like protease (HAT; accession no. AB002134; 
see Yamaoka era/. J. Bio/. Chem. 273:11894-11901); MTSP1 and matriptase 
(also called TADG-15; see SEQ ID Nos. 1 and 2; accession nos. 
AF133086/AF1 18224, AF04280022; Takeuchi eta/. (1999) Proc. Nat/. Acad. 
Sci. U.S.A. 56:11054-1161; Lin era/. (1999) J. Bio/. Chem. 274:18231-18236; 

30 Takeuchi et ai. (2000) J. Bio/. Chem. 275:26333-26342; and Kim eta/. (1999) 
/mmunogenetics 45:420-429); hepsin (see, accession nos. Ml 8930, AF030065, 
X70900; Leytus et ai. (1988) Biochem. 27: 11895-11901; Vu eta/. (1997) J. 
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Biol. Chem. 272:31315-31320; and Farley et ai (1993) Biochem. Biophys. Acta 
; 7 75:350-352; and see, U.S. Patent No. 5,972,6161; TMPRS2 (see, Accession 
Nos- U75329 and AF113596; Paoloni-Giacobino et a/. (1997) Genomics 44:309- 
320; and Jacquinet et al, (2000) FEBS Lett. 468: 93-100); and TMPRSS4 (see, 
5 Accession No. NM 01 6425; Wallrapp et al. (2000) Cancer ffO: 2 602-2 606). 

Serine proteases, including transmembrane serine proteases and secreted 
proteases^ have been implicated in processes involved in neoplastic development 
and progression. While the precise, detailed mechanism by which these 
proteases promote tumor growth and progression has not been elaborated, 

10 serine proteases and inhibitors thereof are involved in the control of many intra- 
and extracellular physiological processes, including degradative actions in cancer 
cell invasion, metastatic spread, and neovascularization of tumors, that are 
involved in tumor progression. It is believed that proteases are Involved in the 
degradation of extracellular matrix (ECM) and contribute to tissue remodeling, 

1 5 and are necessary for cancer invasion and metastasis. The activity and/or 
expression of some proteases have been shown to correlate with tumor 
progression and development. 

For example, a membrane-type serine protease MTSP1 (also called 
matriptase; see SEQ ID Nos. 1 and 2 from U.S. Patent No. 5,972,616; and 

20 GenBank Accession No. AP1 18224; Un eta/. (1999) J. Bio/. Chem. 274:18231- 
18236; U.S. Patent No. 5,792,616; see, also Takeuchi (1999) Proc. Natl. Acad. 
Sci. U.S.A. 96:^ 1054-1 161) that is expressed in epithelial cancer and normal 
tissue (Takeucuhi eta/. (1999) Proc. Natl. Acad. Sci USA 96:^ 1054-61) has 
been identified. Matriptase was originally identified in human breast cancer cells 

25 as a major gelatinase (see, U.S. Patent No. 5,482,848) and was initially believed 
to be a type of matrix metalloprotease (MMP). It has been proposed that it plays 
a role in the metastasis of breast cancer. Matriptase also is expressed in a 
variety of epithelial tissues with high levels of activity and/or expression in the 
human gastrointestinal tract and the prostate. MTSPs, designated MTSP3, 

30 MTSP4. MTSP6 have been described in published International PCT application 
No. WO 01/57194, based in International PCT application No. 
PCT/USOl/03471. 
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Prostate-Specific antigen (PSA), a kallikrein-like serine protease, degrades 
extracellular matrix glycoproteins fibronectin and laminin, and has been 
postulated to facilitate invasion by prostate cancer cells (Webber et af. (1995) 
din. Cancer Res., ;^fO:1089-94}. Blocking PSA proteolytic activity with 
5 PSA-specific monoclonat antibodies results in a dose-dependent decrease in vitro 
in the invasion of the reconstituted basement membrane Matriget by LNCaP 
human prostate carcinoma cells which secrete high levels of PSA. 

Hepsin, a cell surface serine protease identified In hepatoma cells, is 
overexpressed in ovarian cancer (Tanimoto et aL (1997) Cancer Res., 

10 57^74^:2884-7). The hepsin transcript appears to be abundant in carcinoma 
tissue and Is almost never expressed in normal adult tissue, including normal 
ovary. It has been suggested that hepsin is frequently overexpressed in ovarian 
tumors and therefore can be a candidate protease in the invasive process and 
growth capacity of ovarian tumor cells. 

15 A serine protease-like gene, designated normal epithelial cell-specific 1 

(NES1) (Liu et al.. Cancer Res., 66(14}:337A'9 (1996)) has been identified. 
Although expression of the NES1 mRNA Is observed in all normal and 
immortalized nontumorigenic epithelial cell lines, the majority of human breast 
cancer cell lines show a drastic reduction or a complete lack of its expression. 

20 The structural similarity of NES1 to polypeptides known to regulate growth 
factor activity and a negative correlation of NES1 expression with breast 
oncogenesis suggest a direct or indirect role for this protease-like gene product 
in the suppression of tumorigenesis. 

Angioganesis, modulators and associated diseases 

25 Anglogenesis Is the generation of new blood vessels from parent 

microvessels. Controlled and uncontrolled anglogenesis proceed in a similar 
manner. Endothelial cells and pericytes, surrounded by a basement membrane, 
form capillary blood vessels. Anglogenesis begins with the erosion of the 
basement membrane by enzymes released by endothelial cells and leukocytes. 

30 The endothelial cells, which line the lumen of blood vessels, then protrude 

through the basement membrane. Angiogenic stimulants induce the endothelial 
cells to migrate through the eroded basement membrane. The migrating cells 
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form a "sprout" off the parent blood vessel, where the endothelial cells undergo 
mitosis and proliferate. The endothelial sprouts merge with each other to form 
capillary loops, creating the new blood vessel. 

Angiogenesis is highly regulated by a system of angiogenic stimulators 
5 and inhibitors. Known examples of angiogenesis stimulators include certain 
growth factors, cytokines, proteins, peptides, carbohydrates and lipids (Norrby, 
APMIS, 105 :417-437 {1997); Polverini, Crit Rev. Oral. Biol. Med., 6:230-247 
(1995)}. A variety of endogenous and exogenous angiogenesis inhibitors are 
known in the art (Jackson et al., FASEB, 11:457-465 (1997); Norrby, APMIS, 

1 0 105 :41 7-437 (1 997); and O'Reilly, Investigational New Drugs, 15:5-1 3 (1 997)1. 

In adult organisms, capillary endothelial cells divide relatively infrequently. 
When triggered by appropriate signals, e.g., in response to hormonal signals 
during menses or following the release of pro-angiogenic mediators sequestered 
in the extracellular matrix, endothelial cells lining venules will systematically 

1 5 degrade their basement membrane and proximal extracellular matrix, migrate 
directionally, divide, and organize into new functioning capillaries, within a 
matter of days (Polverini, Crit. Rev. Oral. Biol. Med., 6:230-247 (1995)). This 
dramatic amplification of the microvasculature is nevertheless temporary, for as 
rapidly as the new capillaries are formed, they virtually disappear within a matter 

20 of days or weeks, returning the tissue microvasculature to its status quo. It is 
this feature of transient growth and regression of capillaries that primarily 
distinguishes physiological angiogenesis from pathological one (Polverini, Crit. 
Rev. Oral. Biol. Med., 6:230-247 (1995)). In contrast, pathological angiogenesis 
is caused by a shift in the net balance between stimulators and inhibitors of 

25 angiogenesis, e.g., due to the overproduction of normal or aberrant forms of 

angiogenic mediators, or due to a relative deficiency in inhibitors of this process 
(Polverini, Crit. Rev. Oral. Biol. Med., 6:230-247 (1995)). 

Angiogenesis is essential for normal placental, embryonic, fetal and post- 
natal development and growth, but almost never occurs physiologically in 

30 adulthood except in very specific restricted situations. For example, 

angiogenesis is normally observed in wound healing, fetal and embryonal 
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devetopment and formation of the corpus luteum, endometrium and placenta. 
Angiogenesis in the adult is often associated with disease states. 

Persistent, unregulated angiogenesis occurs in a multiplicity of disease 
states, tumor metastasis and abnormal growth by endothelial cells artd supports 
5 the pathological damage seen in these conditions. The diverse pathological 

disease states in which unregulated angiogenesis is present have been grouped 
together as angiogenic dependent or angiogenic associated diseases. 

The control of angiogenesis is altered in certain disease states and, in 
many cases, the pathological damage associated with the disease is related to 

10 uncontrolled angiogenesis [see generaUy Norrby, APMiS, 105 :417-437 (1997); 
and O'Reilly, investigational New Drugs. 1_5:5-13 (1 997)). Thus, angiogenesis 
is involved in the manifestation or progress of various diseases, for example, 
various inflammatory diseases, such as rheumatoid arthritis, psoriasis, diabetic 
retinopathies, certain ocular disorders, including recurrence of pterygii, scarring 

15 excimer laser surgery and glaucoma filtering surgery, various disorders of the 
anterior eye, cardiovascular disorders, chronic inflammatory diseases, wound 
repair, circulatory disorders, crest syndromes, dermatological disorders {see, 
e.g., U.S. Patent Nos. 5,593,990, 5,629,327 and 5,712,291) and notably 
cancer, including solid neoplasms and vascular tumors. Several lines of direct 

20 evidence indicate that angiogenesis is essential for the growth and persistence of 
solid tumors and their metastases. 

Thus, it is clear that angiogenesis plays a major role in the metastasis of 
cancer and in the pathology of a variety of other disorders. Repressing, 
eliminating or modulating this activity, should impact the etiology of these 

25 diseases and serve as a point of therapeutic intervention. In the disease state, 
prevention of angiogenesis could avert the damage caused by the invasion of the 
new microvascular system. Therapies directed at control of the angiogenic 
processes could lead to the abrogation or mitigation of these diseases. 

Hence there is a need to develop therapeutics that target angiogenesis 

30 and modulate, particularly, inhibit aberrant or uncontrolled angiogenesis. In 
addition, transmembrane serine proteases appear to be involved in the etiology 
and pathogenesis of tumors. There is a need to further elucidate their role in 
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these processes and to identify additional transmembrane proteases. Therefore, 
it is an object herein, among others, to provide transmembrane serine protease 
(MTSP) proteins and nucleic acids encoding such MTSP proteases that are 
involved in the regulation of, or participate in, tumorlgenesis and/or 
5 carcinogenesis. It is also an object, among others, herein to provide nucleic 
acids encoding the proteins and polypeptides and also to provide proteins and 
polypeptides that are involved in the regulation of angiogenesis. Among the 
objects herein, it is also an object to provide prognostic, diagnostic and 
therapeutic screening methods using such proteases and the nucleic acids 
1 0 encoding such proteases, and to provide assays for identification of agents that 
target angiogenesis and modulate, particularly, inhibit aberrant or uncontrolled 
angiogenesis. 
SUMMARY 

Provided herein are polypeptides, including protease domains thereof, 
15 designated herein as MTSP20 polypeptides. Protease domains and full-length 
polypeptides, including the zymogen and activated forms, and uses thereof are 
also provided. Polypeptides encoded by splice variants are also provided. Also 
provided are muteins and other derivatives and analogs thereof. Also provided 
herein are nucleic acids encoding the MTSP20s. The MTSP20s provided herein 
20 are also endotheliases, which are a class membrane proteases that are 

expressed on cells, particularly endothelial cells that participate in angiogenesis. 

Protease domains provided herein include, but are not limited to, the 
single chain region having an N-terminus at the cleavage site for activation of the 
zymogen, through the C-terminus, or C-terminal truncated portions thereof that 
25 exhibit proteolytic activity as a single-chain polypeptide in in vitro proteolysis 
assays of MTSP20, from a mammal, including a human, that, for example, 
displays functional activity in tumor cells that is different from its activity non- 
tumor cells. 

Nucleic acid molecules encoding the proteins and protease domains are 
30 also provided. Nucleic acid molecules that encode single-chain protease domains 
or catalytically active portions thereof and multichain forms of proteases 
domains, and also those that encode the full-length MTSP20 and portions 
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thereof are provided. In an exemplary embodiment, a nucleic acid that encodes 
an MTSP20 is provided. The nucleic acid molecule includes the sequence of 
nucleotides set forth in SEQ ID No. 5 or SEQ ID No. 15 or a portion thereof {see, 
also EXAMPLE 1) that encodes a catalytically active polypeptide or a domain 
5 thereof. Nucleic acid encoding a protease domain and upstream nucleic acid is 
set forth in SEQ ID No. 5; and a protease domain of MTSP20 is set forth in SEQ 
ID No. 6. An exemplary nucleic acid molecule encoding the full-length 
polypeptide, which includes two protease domains, is set forth in SEQ ID No. 
1 5; the encoded polypeptide is set forth in SEQ ID No. 1 6. 

10 Also provided are nucleic acid molecules that encode all or a portion of a 

catalytically active polypeptide, or a nucleic acid molecule that encodes a 
protease domain or a larger polypeptide that can include up to the full length 
polypeptide, and that hybridizes to such MTSP20-encoding nucleic acid along 
their full-length or along at least about 70%, 80% or 90% of the full-length and 

15 encode a protease domain or portion thereof are provided. Hybridization is 

generally effected under conditions of at least low, generally at least moderate, 
and often high stringency. 

The isolated nucleic acid fragment is DNA, including genomic or cDNA, or 
is RNA, or can include other components, such as peptide nucleic acid (PNA) or 

20 other nucleotide analogs. The isolated nucleic acid can include additional 

components, such as heterologous or native promoters, and other transcriptional 
and translational regulatory sequences. These genes may be linked to other 
genes, such as reporter genes or other indicator genes or genes that encode 
indicators. 

25 Also provided are isolated nucleic acid molecules that include a sequence 

of molecules that is complementary to the nucleic acid molecule encoding an 
MTSP20 or the portion thereof. 

Also provided are nucleic acid molecules that hybridize under conditions 
of at least low stringency, generally moderate stringency, more typically high 

30 stringency to the sequence of nucleotides set forth in SEQ ID No. 5 or SEQ ID 
No. 15 or degenerates thereof. In one embodiment, the isolated nucleic acid 
fragment hybridizes to a nucleic acid molecule containing the nucleotide 
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sequence set forth in SEQ ID No. 5 or SEQ ID No. 15 (or degenerates thereof) 
under high stringency conditions. In one embodiment, it contains the sequence 
of nucleotides set forth in SEQ ID No. 5. A full-length MTSP20 polypeptide 
includes the sequence of amino acids set forth in SEQ ID No. 6 or SEQ ID No. 
5 16, and is encoded by a sequence of nucleotides set forth in SEQ ID No. 5 or 
SEQ ID No. 1 5 or degenerates thereof. Methods for isolating nucleic acid 
encoding other MTSP20s, including nucleic acid molecules encoding full-length 
molecules and splice variants and MTSPs from species, such as cows, sheep, 
goats, pigs, horses, primates, including chimpanzees and gorillas, rodents, dogs, 

10 cats and other species of interest, such as domesticated animals, farm and zoo 
animals are also provided. The nucleic acid molecules provided herein, including 
those set forth in SEQ ID Nos. 5 and 15 can be used to obtain nucleic acid 
molecules encoding full-length MTSP20 polypeptides from human sources or 
from other species, such as by screening or PCR amplification, or other such 

1 5 methods, appropriate libraries using the nucleic acid molecules or selected 
primers or probes based thereon. 

Also provided are fragments thereof or oligonucleotides that can be used 
as probes or primers and that contain at least about 10, 14, 16 nucleotides, 
generally less than 1000 or less than or equal to 100, set forth in SEQ ID No. 5 

20 or SEQ ID No. 15 (or the complement thereof); or contain at least about 30 
nucleotides (or the complement thereof) or contain oligonucleotides that 
hybridize along their full-length (or at least about 70, 80 or 90% thereof) to any 
such fragments or oligonucleotides. The length of the fragments are a function 
of the purpose for which they are used and/or the complexity of the genome of 

25 interest. Generally probes and primers contain less than about 30, 50, 150 or 
500 nucleotides. 

Also provided are plasmids containing any of the nucleic acid molecules 
provided herein. Cells containing the plasmids are also provided. Such cells 
include, but are not limited to, bacterial celts, yeast cells, fungal cells, plant ceils, 
30 insect celts and animal cells. 

Methods of expressing the encoded MTSP20 polypeptide and portions 
thereof using the cells are also provided, as are cells that express MTSP20 on 
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the cell surface. Such cells are used in methods of identifying candidate 
therapeutic compounds. 

MTSP20, particularly protease domains thereof, can be produced by 
growing the above-described cells under conditions whereby the MTSP20 is 
5 expressed by the cells, and recovering the expressed MTSP20 polypeptide. 

Also provided are cells, generally eukaryotlc cells, such as mammalian 
cells and yeast cells, in which the MTSP20 polypeptide is expressed on the 
surface of the cells. Such cells are used in drug screening assays to identify 
compounds that modulate an activity of the MTSP20 polypeptide. These 

10 assays, including in vitro binding assays, and transcription based assays in 

which signal transduction is mediated directly or indirectly, such as via activation 
of pro-growth factors, by the MTSP20 is assessed. 

Also provided are peptides that are encoded by such nucleic acid 
molecules. Included among those polypeptides are an MTSP20 protease domain 

15 or a polypeptide with amino acid changes such that the specificity and/or 

protease activity remains substantially unchanged. In particular, a substantially 
purified mammalian MTSP20 polypeptide is provided that includes a serine 
protease catalytic domain and may additionally include other domains. The 
MTSP20 can form homodimers and can also form heterodimers with some other 

20 protein, such as a membrane-bound protein. Also provided is a substantially 
purified protein including a sequence of amino acids that has at least 60%, 70%, 
80%, 81 %, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 
93%, 94%, 95%, 96%, 97%, 98%, 99% identity to the MTSP20 where the 
percentage identity is determined using standard algorithms and gap penalties 

25 that maximize the percentage identity. A human MTSP20 polypeptide is 

exemplified, although other mammalian MTSP20 polypeptides are contemplated. 
Splice variants of the MTSP20, particularly those with a proteolytically active 
protease domain, are contemplated herein. 

In other embodiments, substantially purified polypeptides that include a 

30 protease domain of a MTSP20 polypeptide or a catalytically active portion 

thereof are provided. Among these are polypeptides that include a sequence of 
amino acids that has at least 60%, 70%, 80%, 85%, 90%, 95% or 100% 
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sequence identity to SEQ ID No. 6 or SEQ ID No. 16 or to a portion thereof that 
includes a catalytically active polypeptide. 

Also provided are muteins of single chain protease domains of MTSP20 
particularly muteins in which the Cys residue in a protease domain that is free 
5 {i.e., does not form disulfide linkages with any other Cys residue in. a protease 
domain) is substituted with another amino acid substitution, typically, although 
not necessarily, with a conservative amino acid substitution or a substitution 
that does not eliminate an activity, and muteins in which a glycosylation site(s) 
is eliminated. 

10 Hence muteins in which one or more of the Cys residues, particularly, a residue 
that is paired in the activated two form, but unpaired in a protease domain alone 
{i.e., the Cys {Cjjg) residue position 219 and the Cys (C477) residue 477 SEQ ID 
Nos. 15 and 16 in a protease domain), is/are replaced with any amino acid, 
typically, although not necessarily, a conservative amino acid residue, such as 

15 Ser, are contemplated. Muteins of MTSP20, particularly those in which Cys 
residues, such as the unpaired Cys in the single chain protease domain, is 
replaced with another amino acid that does not eliminate an activity, are 
provided. Muteins in which other conservative or non-conservative amino acid 
substitutions in which catalytic activity is retained are also contemplated (see, 

20 e.g., Table 1, for exemplary amino acid substitutions). 

MTSP20 polypeptides, including, but not limited to splice variants 
thereof, and nucleic acids encoding MTSPs, and domains, derivatives and 
analogs thereof are provided herein. Single chain protease domains that have an 
N-terminus functionally equivalent to that generated by activation of the 

25 zymogen form of MTSP20 are also provided. The MTSP20 exemplified herein 
(see SEQ ID No. 16) has a signal peptide (aa 1 to aa 24), a transmembrane 
domain (aa 82 to aa 99) and two potential N-glycosylation sites (...N„QS... and 
...Naai'S). This MTSP20 has two trypsin-like serine protease domains in tandem, 
herein after referred to as MTSP20-PD1 (aa 1 13 to aa 343) and MTSP20-PD2 

30 (aa 375 to aa 596). Both protease domains are characterized by the presence of 
a protease activation cleavage site at the beginning of the domain and the 
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catalytic triad residues (histidine, aspartate and serine) in 3 highly-conserved 
regions of the catalytic domain. 

MTSP20-PD1 has the following features: a protease cleavage site 
(...KiijiPiiaQEGN..., where 1 indicates the cleavage site); catalytic triad 
5 residues {H^gj, D203 and S299); the following cysteine pairings within the protease 
domain {0,3,-0,53, C233-C305. 02^-028^ and Cjss-CSjJ; an unpaired cysteine (Cj,g) 
in the protease domain is predicted to pair with 0,03 so that a multichain form 
would include amino acid residues from the cleavage site (K,,2 up to and 
including the C,o3 ancf optionally back to the N-terminus in the second chain 

10 formed upon activation cleavage). 

MTSP20-PD2 has the following features: the protease cleavage site 
(...R375*T376AGPQ...|; catalytic triad residues (H^^g, D457 and 8553); the following 
cysteine pairings within the protease domain (0401-0417, Cgig-Oggg and C549-C577); 
an unpaired cysteine (O477) in the protease domain is predicted to pair with C37, 

15 so that a multichain form would include amino acid R375 up to and including the 
C371 and optionally beyond towards the N-terminus in the second chain formed 
upon activation cleavage. In addition, as noted PD1 can form two chains, so 
that the entire molecule can include at least three chains. 

As noted the protease is provided as a single- or multi-chain form, such 

20 as two- or three- or more-chain molecule. Single chain forms of each protease 
domain and each of the multi- chain forms of a protease domains up to the full 
length forms are proteolytically active. As noted above, an unpaired Cys in a 
protease domain pairs with a Oys outside the domain. Upon activation cleavage 
the bond remains resulting in two or three chains. The size of chain "A", that is 

25 the most N-terminal chain, is a function the starting length of the polypeptide 
prior to activation cleavage. Similarly, the size of the second so-called chain 
"B" chain upon activation cleavage of the second protease domain in the 
polypeptide is a function of the starting length of the polyeptide prior to 
activation cleavage. Any length polypeptide that includes a protease domain or 

30 catalytically active fragments thereof, is contemplated herein, fn particular, any 
polypeptide that includes MTSP20 PD2 is provided, including a full-length 
MTSP20 polyeptide that includes two protease domains and catatically active 
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portions there of that include amino acids 624-642 or portions thereof. Also 
provided are the Isolated protease domain PD1 as an isolated single cahin 
protease and as a two chain activated form that includes up to amino acid 343 
(SEQ ID NO. 16), particularly amino acids 1 13-343 as a single chain isolated 
5 polypeptide. Also provided are polyeptides encoded by nucleic acid molecules 
that are encoded by nucleic acid molecules that hybridize under specified 
conditions as described herein to the exemplified nucleic acid molecule and 
polypeptides that are homologous as described herein to the the exemplified 
polypeptide. 

10 MTSPs are expressed or are activated in certain tumor or cancer cells 

such as lung, prostate, colon and breast, ovarian, pancreatic, lung in other 
tumors. MTSP20 is of interest because it is expressed or is active in tumor 
cells. In particular, it is shown herein, that MTSP20 is, for example, expressed 
in esophageal tumor tissues, in lung carcinoma, colon, cervical, leukemia and 

15 other cell lines as well as in certain normal cells and tissues (see e.g., 

EXAMPLES for tissue-specific expression profile). The level of activated 
MTSP20 can be diagnostic of prostate, uterine, lung, esophagus or colon cancer, 
or leukemia or other cancer. The expression and/or activation of MTSP20 on or 
In the vicinity of a celt or in a bodily fluid in a subject can be a marker for breast, 

20 prostate, lung, colon and other cancers. 

Hence the MTSPs provided herein can serve as diagnostic markers for certain 
tumors. In certain embodiments, the MTSP20 polypeptide or portions thereof, 
particularly a protease domain thereof, is detectable in a body fluid at a level or 
in a form that differs from its level or in a form in body fluids in a subject not 

25 having a tumor. In other embodiments, the polypeptide is present in a tumor; 
and a substrate or cofactor for the polypeptide is expressed at levels that differ 
from its level of expression in a non-tumor cell in the same type of tissue. In 
other embodiments, the level of expression and/or activity of the MTSP20 
polypeptide in tumor cells differs from its level of expression' and/or activity in 

30 non-tumor cells. In other embodiments, the MTSP20 is present in a tumor; and 
a substrate or cofactor for the MTSP20 is expressed at levels that differ from its 
level of expression in a non-tumor cell in the same type of tissue. 
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Assays for identifying effectors, such as compounds, including small 
molecules, and conditions, such pH, temperature and Ionic strength, that 
modulate the activation, expression or activity of MTSP20 are also provided 
herein. In exemplary assays, the effects of test compounds on the ability of a 



5 protease domain of MTSP20 to proteolytically cleave a known substrate, ' - 
typically a fluorescently, chromogenically or otherwise detectably labeled 
substrate, are assessed. Agents, generally compounds, particularly small 
molecules, that modulate an activity of a protease domain are candidate 
compounds for modulating an activity of an MTSP20. A protease domains can 

10 also be used to produce protease-specific antibodies. 

Also provided are methods for screening for compounds that modulate an 
activity of MTSP20. The compounds are identified by contacting them with an 
MTSP20 or protease domain thereof and a substrate for the MTSP20. A change 
in the amount of substrate cleaved in the presence of the compounds compared 

15 to that in the absence of the compound indicates that the compound modulates 
* an activity of an MTSP20. Such compounds are selected for further analyses or 
for use to modulate the activity of an MTSP20, such as inhibitors or agonists. 
The compounds can also be identified by contacting the substrates with a cell 
that expresses an MTSP20 or an extracellular domain or proteolytically active 

20 portion of thereof. 

Computer-based screening methods are also provided. In these methods, 
these interactions between test compounds computer simimulated MTSP20 
polypeptides are assessed, such as by computational docking or binding studies. 
Test compounds predicted to bind or otherwise interact with MTSP20 

25 polypeptides are selected as drug candidates. Further characterization and 



study, such in vitro assays, can be performed. 

Also provided herein are methods of modulating an activity of an MTSP20 
and screening for compounds that modulate, including inhibit, antagonize, 
agonize or otherwise alter an activity of the MTSP20. Of particular interest is an 
30 extracellular domain of MTSP20 that includes a proteolytic (catalytic) portion of 
the protein. 
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Additionally provided herein are antibodies that specificallv bind to single 
and/or multi-chain fornns of MTSP20, cells, combinations, kits and articles of 
manufacture that contain the antibodies. Antibodies that specifically bind to the 
MTSP20, particularly a single-chain protease domain, a two-chain form of a 
5 protease domain, the zymogen and multi-chain activated forms of an MTSP20 
and other fragments thereof. Neutralizing antibodies that inhibit a biological 
activity, particularly a protease activity are also provided. In particular, 
antibodies that specifically bind to an MTSP20 that includes two protease 
domains, particularly that includes amino acids that correspond to amino acids 

10 624-642 of SEQ ID No. 16, with at least 10-foid, generally at least 100-fold, 
greater affinity than to an MTSP of SEQ ID No. 1 7 are provided. 

Further provided herein are prognostic, diagnostic, therapeutic screening 
methods using MTSP20 and the nucleic acids encoding MTSP20. In particular, 
the prognostic, diagnostic and therapeutic screening methods are used for 

15 preventing, treating, or for finding agents useful in preventing or treating, tumors 
or cancers such as lung carcinoma, colon adenocarcinoma and ovarian 
carcinoma. 

Also provided herein are modulators of an activity of MTSP20, especially 
the modulators obtained according to the screening methods provide herein. 
20 Such modulators can have use in treating cancerous conditions. 

Methods of diagnosing a disease or disorder characterized by detecting an 
aberrant level of an MTSP20 in a subject is provided. The method can be 
practiced by measuring the level of the DNA, RNA, protein or a functional 
activity of an MTSP20. An increase or decrease in the level of the DNA, RNA, 
25 protein or functional activity of the MTSP, relative to the level of the DNA, RNA. 
protein or functional activity found in an analogous sample not having the 
disease or disorder (or other suitable control) is indicative of the presence of the 
disease or disorder in the subject or other relative any other suitable control. 

Also provided are methods of identifying a compound that binds to the 
30 single-chain and/or multi-chains form of MTSP20. by contacting a test 

compound with these forms; determining to which form the compound binds; 
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and if it binds to a form of MTSP20, further determining whether the compound 
has at (east one of the following properties: 

{)) inhibits activation of a zymogen form of a protease domain of 

an MTSP20; 

5 (ii) inhibits activity of catalytically active form (i.e. a mutll-chain or 

single-chain form of a protease domain); and 

(iii) inhibits dimerization of the protein. 
The forms can be full length or truncated forms, including but not limited to, a 
protease domains resulting from cleavage at one or both of the activation 
10 cleavage sites; or from expression of a protease domain or catalytically active 
portions thereof. 

Methods for identifying compounds that bind to or interact with the 
MTSP20, particularly, one or both of a protease domains thereof, are provided. 
The identified compounds are candidates or leads for identification of 

1 5 compounds for treatments of tumors and other disorders and diseases involving 
aberrant angtogenesis and/or loss of appropriate growth regulation. 

Pharmaceutical composition containing a protease domain and/or full- 
length or other domain of an MTSP20 polypeptide are provided herein in a 
pharmaceutically acceptable carrier or excipient are provided herein. 

20 Also provided are articles of manufacture that contain MTSP20 

polypeptides and/or a protease domain or protease domains of an MTSP20 in 
single chain forms or activated forms. The articles contain a) packaging 
material; b) the polypeptide (or encoding nucleic acid), particularly the single 
chain protease domain thereof; and c) a label indicating that the article is for 

25 using in assays for identifying modulators of the activities of an MTSP20 
polypeptide. 

Conjugates containing a) an MTSP20 polypeptide or protease domain In a 
single or multi-chain form; and bl a targeting agent linked to the MTSP directly or 
via a linker, wherein the agent facilitates: i) affinity isolation or purification of the 
30 conjugate; ii) attachment of the conjugate to a surface; iii) detection of the 
conjugate; or iv) targeted delivery to a selected tissue or cell, are provided 
herein. The conjugate can contain a plurality of agents linked thereto. The 



wo 03/004681 



PCT/US02/21208 



-18- 

conjugate can be a chemical conjugate; and it can be a fusion protein. The 
targeting agent can be a protein or peptide fragment. The protein or peptide 
fragment can include a protein binding sequence, a nucleic acid binding 
sequence, a lipid binding sequence, a polysaccharide binding sequence, or a 
5 metal binding sequence. 

Combinations, kits and articles of manufacture containing the MTSP20 
polypeptides, domains thereof, or encoding nucleic acids are also provided 
herein. For example, combinations are provided herein. The combination can 
include: a) an Inhibitor of an activity of an MTSP20; and b) an anti-cancer 

10 treatment or agent. The MTSP inhibitor and the anti-cancer agent can be 
formulated in a single pharmaceutical composition or each is formulated in a 
separate pharmaceutical composition. The MTSP20 inhibitor can be an antibody 
or a fragment or binding portion thereof made against the IVITSP20, such as an 
antibody that specifically binds to a protease domain, an inhibitor of MTSP20 

15 production, or an inhibitor of MTSP20 membrane-localization or an Inhibitor of 
MTSP20 activation. Other MTSP20 inhibitors include, but are not limited to, an 
antisense nucleic acid or double-stranded RNA {dsRNAK such as RNAi, encoding 
the MTSP20, particularly a portion of a protease domain; a nucleic acid encoding 
at least a portion of a gene, encoding the MTSP20 with a heterologous nucleotide 

20 sequence Inserted therein such that the heterologous sequence inactivates the 
biological activity of MTSP20 or the gene encoding it. For example, the portion 
of the gene encoding the MTSP20 can flank the heterologous sequence to 
promote homologous recombination with a genomic gene encoding the MTSP20. 
In another embodiment, combinations are provided herein that include an 

25 inhibitor of an MTSP20 or an inhibitor of a protease activity thereof, and another 
anti-angiogenic treatment or agent. The MTSP20 inhibitor and the anti- 
angiogenic agent can be formulated in a single pharmaceutical composition or 
each can be formulated in a separate pharmaceutical composition. Kits 
containing the combinations are provided. 

30 Also provided are transgenic non-human animals bearing inactivated 

genes encoding the MTSP and bearing the genes encoding the MTSP20 under 
non-native promoter control are provided. Such animals are useful in animal 
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models of tumor initiation, growth and/or progression models. Transgenic non- 
human animals containing heterolgous nucleic acid MTSP20 under native, non- 
nattve promoter control or on an exogenous element, such as a plasmid or 
artificial chromosome, are additionally provided herein. In particular, 
5 recombinant non-human animals are provided herein, where the gene of an 
MTSP20 is under control of a promoter that is not the native promoter of the 
gene or that is not the native promoter of the gene in the non-human animal or 
where the nucleic acid encoding the MTSP20 is heterologous to the non-human 
animal and the promoter is the native or a non-native promoter or the MTSP20 Is 

10 on an extrachromosomal element, such as a plasmid or artificial chromosome. 
Recombinant and transgenic animals can be produced by homologous 
recombination and non-homologous recombination methods. 

Methods of gene therapy are provided. Such methods can be effected by 
administering in vivo or ex vivo an inactivating form of the MTSP20 or by 

1 5 administering an MTSP-encoding nucleic acid molecule. 

Also provided are methods for treating or preventing a tumor or cancer in 
a mammal by administering to a mammal an effective amount of an inhibitor of 
an MTSP20, whereby the tumor or cancer is treated or prevented. The MTSP20 
inhibitor used in treatment or for prophylaxis is administered with a 

20 pharmaceutically acceptable carrier or excipient. The mammal treated can be a 
human. The treatment or prevention method can additionally include 
administering an anti-cancer treatment or agent simultaneously with, subsequent 
to, or before administration of the MTSP20 inhibitor. 

Also provided are methods of treatment of tumors by administering a 

25 prodrug that is activated by MTSP20 that is expressed or active in tumor cells 
and/or angiogenic endothelial cells, particularly those in which its functional 
activity in tumor cells is greater than in non-tumor cells or in which its functional 
activity in angiogenic endothelial is greater than in quiescent endothelial cells. 
The prodrug is administered and, upon administration, active MTSP20 expressed 

30 on cells cleaves the prodrug and releases active drug in the vicinity of the tumor 
cells. The active anti-cancer drug accumulates in the vicinity of the tumor. This 
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is useful since the MTSP20 is expressed or active in tumor cells and/or 
angiogenic endothelial cells and also is useful where the MTSP20 is expressed or 
active in greater quantity, higher level or predonninantly in tumor cells and/or 
angiogenic endothelial cells compared with other cells. 
5 Also provided are methods of diagnosing the presence of a pre-malignant 

lesion, a malignancy, or other pathologic condition in a subject, by obtaining a 
biological sample from the subject; and exposing it to a detectable agent that 
binds to a multi-chain and/or single-chain form of MTSP20. where the 
pathological condition is characterized by the presence or absence of the multi- 

10 chain and/or single-chain form. Prognostic, diagnostic and therapeutic 
screening methods for preventing or treating, or for finding agents useful in 
preventing or treating, diseases or disorders associated with an aberrant level of 
angiogenesis and/or cell proliferation also are provided. 

Methods for screening for compounds' that modulate an activity of the 

15 MTSP20 are provided. In vitro assays in which the candidate compounds are 
identified by contacting them with the MTSP20 or a protease domain thereof 
and a substrate for the MTSP20 are provided. A change in the amount of 
substrate cleaved in the presence of the compounds compared to the absence of 
a candidate compound indicates that the compound modulates an activity of an 

20 MTSP20 or a protease domain or domains thereof. Such candidate compounds 
are selected for further analyses or for use to modulate an activity of an 
MTSP20, such as, as inhibitors or agonists. The in vitro assays can be 
performed In liquid phase or on solid phase substrates by linking the MTSP20 or 
protease domain thereof directly or via a linker to a solid support. Cell-based 

25 screening assays are also provided. The compounds can also be identified by 
contacting the substrates with a cell that expresses the MTSP20 or a 
proteolytically active portion thereof. 

In one embodiment, the method for identifying a modulator of an activity 
of an MTSP20 includes: a) contacting an MTSP20 with a substrate therefor, and 

30 detecting the proteolysis of the substrate, whereby an activity of the MTSP20 is 
assessed; b) contacting the I\/ITSP20 with a substrate thereof in the presence of 
a test substance, and detecting the proteolysis of the substrate, whereby an 
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activity of the MTSP20 is assessed; and c) comparing an activity of the MTSP20 
assessed in steps a) and b), whereby a difference in activity measured in step a) 
from an activity measured in step b) indicates that the test substance modulates 
an activity of an MTSP20. A plurality of the test substances can be screened 
5 simultaneously In the above screening method. 

In another embodiment, the MTSP20 to be screened is isolated from a 
target cell and the test substance is a therapeutic compound. A difference in the 
MTSP20 activity measured in the presence and in the absence of the test 
substance indicates that the target cell responds to the therapeutic compound. 

10 Diagnostic methods are also provided- For example, detection of 

protease domain(s) in the blood or other body fluid can be indicative of cancer, 
particularly metastatic cancer. 

Methods of inhibiting tumor invasion or metastasis or treating a malignant 
or pre-malignant condition by administering an agent that inhibits activation of a 

15 zymogen form of MTSP20 of a protease domain of an MTSP20 or an activity of 
an activated form are provided. The conditions include, but are not limited to, a 
condition, such as a tumor, of the breast, cervix, prostate, lung, ovary or colon. 
DETAILED DESCRIPTION 
A. Definitions 

20 Unless defined otherwise, all technical and scientific terms used herein 

have the same meaning as is commonly understood by one of skill In the art to 
which the invention(s) belong. All patents, patent applications, published 
applications and publications, Genbank sequences, websites and other published 
materials referred to throughout the entire disclosure herein, unless noted 

25 otherwise, are incorporated by reference in their entirety. In the event that there 
are a plurality of definitions for terms herein, those in this section prevail. 
Where reference is made to a URL or other such indentifier or address, it 
understood that such identifiers can change and particular information on the 
internet can come and go, but equivalent information can be found by searching 

30 the internet. Reference thereto evidences the availability and public 
dissemination of such information. 
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As used herein, the abbreviations for any protective groups, amino acids 
and other compounds, are, unless indicated otherwise, in accord with their 
common usage, recognized abbreviations, or the lUPAC-lUB Commission on 
Biochemical Nomenclature (see, (1972) Biochem. //:942-944). 
5 As used herein, serine protease refers to a diverse family of proteases 

wherein a serine residue is involved in the hydrolysis of proteins or peptides. 
The serine residue can be part of the catalytic triad mechanism, which includes a 
serine, a histidine and an aspartic acid in the catalysis, or be part of the 
hydroxyl/f-amine or hydroxyl/a-amine catalytic dyad mechanism, which involves 

10 a serine and a lysine in the catalysis. Of particular interest are serine proteases 
(SPs) of mammalian, including human, origin. Those of skill in this art recognize 
that, in general, single amino acid substitutions in non-essential regions of a 
polypeptide do not substantially alter biological activity (see, e.g., Watson et al. 
(1987) Molecular Biology of the Gene. 4th Edition, The Benjamin/Cummings Pub. 

15 CO., p.224). 

As used herein, "transmembrane serine protease (MTSP)** refers to a 
family of transmembrane serine proteases that share common structural features 
as described herein (see, also Hooper et al. (2001) J. Biol. Chem.276:&57-S60), 
Thus, reference, for example, to *'MTSP" encompasses all proteins encoded by 

20 the MTSP gene family, including but are not limited to; MTSP3, MTSP4, 

MTSP6, MTSP7 or an equivalent molecule obtained from any other source or 
that has been prepared synthetically or that exhibits the same activity. Other 
MTSPs include, but are not limited to, corin, enteropeptidase, human airway 
trypsin-like protease (HAT), MTSP1, TMPRSS2 and TMPRSS4. Sequences of 

25 encoding nucleic acid molecules and the encoded amino acid sequences of 
exemplary MTSPs and/or domains thereof are set forth, for example in U.S. 
application Serial No. 09/776,191 (SEQ ID Nos. 1-12, 49, 50 and 61-72 therein, 
published as International PCT application No. WO 01/57194). The term also 
encompass MTSPs with amino acid substitutions that do not substantially alter 

30 activity of each member and also encompasses splice variants thereof. Suitable 
substitutions, including, although not necessarily, conservative substitutions of 
amino acids, are known to those of skill in this art and can be made without 
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eliminating the biological activity, such as the catalytic activity, of the resulting 
molecule. 

As used herein an MTSP20, whenever referenced herein, includes at 

least one or all of or any combination of: 
5 a polypeptide encoded by the sequence of nucleotides set forth in 

SEQ ID No. 5 or SEQ ID No. 15 or by a sequence of nucleotides that includes 

nucleotides that encode the sequence of amino acids set forth in SEQ ID No. 6 

or SEQ ID No. 16; 

a polypeptide encoded by a sequence of nucleotides that 
10 hybridizes under conditions of low, moderate or high stringency to the sequence 

of nucleotides set forth In is set forth as SEQ ID No. 5 or SEQ ID No. 1 5; 

a polypeptide that includes the sequence of amino acids set forth 

in SEQ ID No. 6 or SEQ ID No. 16 or a catalytically active portion thereof, 

including PD1 and/or PD2 and catalytically active portions thereof; 
15 a polypeptide that includes a sequence of amino acids having at 

least about 60%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 

88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% 

sequence identity with the sequence of amino acids set forth in SEQ ID No. 6 or 

SEQ ID No. 16; and/or 
20 a polypeptide encoded by a splice variant of the MTSP20 that 

includes the sequence of amino acids set forth in SEQ ID No. 6 or SEQ ID No. 

16, particularly residues that include residues 624-642 (or a portion thereof) of 

SEQ ID No. 16. 

In particular, MTSP20 polypeptides, with protease domains as indicated 
25 in SEQ ID Nos. 5, 6, 15 and 16 are provided. The polypeptide is a single or 
multi-chain polypeptide. Each protease domain can form a two chain 
polypeptide upon activation cleavage and can be employed as a portion of the 
full length molecule or as a full-length molecule. The full-length molecule can be 
two chains, three chains or. If there are additional cleavages, more chains. 
30 Where there are two chains, the shortest chain (most N-terminal chain) is called 
the A chain, the next chain the B chain and the third chain the C chain {when 
both protease domains are activated). Where a single protease domain or 
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fragment thereof or longer form thereof activated and isolated, the two chains 
are referred to as the A chain and B chain, respectively. 

Smaller portions thereof that retain protease activity are also provided. 
Protease domains from MTSPs vary in size and constitution, including insertions 
5 and deletions in surface loops. They retain conserved structure, including at 
least one of the active site triad, primary specificity pocket, oxyanion hole and/or 
other features of serine protease domains of proteases. Thus, for purposes 
herein, a protease domain is a portion of an f^TSP, as defined herein, and is 
homologous to a domain of other MTSPs, such as corin, enteropeptldase, human 

10 airway trypsin-like protease (HAT), MTSP1, TMPRSS2, and TMPRSS4, which 

have been previously identified; it was not recognized, however, that an isolated 
single chain form of a protease domain could function proteolytically in in vitro 
assays. As with the larger class of enzymes of the chymotrypsin (31) fold (see, 
e,g., Internet accessible MEROPS data base), the MTSPs protease domains share 

15 a high degree of amino acid sequence identity. The His, Asp and Ser residues 
necessary for activity are present in conserved motifs. The activation site, 
which results in the N-terminus of the second chain in a multi-chain form is 
located in a conserved motif and/or location and can be identified. In the 
exemplified MTSP20, there are two activation cleavage sites and two protease 

20 domains. 

The MTSP20 can be from any animal, particularly a mammal, and 
includes but are not limited to, primates, including humans, rodents, fowl, 
ruminants, pigs and other animals. The full-length zymogen or multi-chain 
activated forms are contemplated or any domain thereof, including protease 

25 domains, which can be two-chain activated forms, or a single chain forms. 

As used herein, a "protease domain of an MTSP" refers to an extracellular 
protease domain of an MTSP that exhibits proteolytic activity and shares 
homology and structural features with the chymotrypsin/trypsin family protease 
domains. Hence it is at least the minimal portion of the domain that exhibits 

30 proteolytic activity as assessed by standard in vitro assays. Contemplated 

herein are such protease domains and catalytically active portions thereof. Also 
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provided are truncated forms of a protease domain that Include the smallest 
fragment thereof that acts catalytically as a single chain form. 

A protease domain of an MTSP20, whenever referenced herein, includes 
at least one or all of or any combination of or a catalytically active portion of: 
5 a) a polypeptide that includes the sequence of amino acids set 

forth in SEQ ID No. 6 and those in which residue 247 is lie as the only MTSP20 
residues or a catalytically active portion thereof; 

b) a polypeptide that includes the sequence of amino acids set 
forth as residues 376-642 in SEQ ID No. 16; 
10 c) a polypeptide encoded by a sequence of nucleotides that 

hybridizes under conditions of low, moderate or high stringency to the sequence 
of nucleotides set forth in SEQ ID No. 5 or to SEQ ID No. 15 or to the sequence 
of amino acids that encodes at least residues 624-642 or SEO ID No. 16; 

d) a polypeptide that includes the sequence of amino acids set 

15 forth in SEQ ID No. 6 and at least residues 376-624, generally 376-642, of SEQ 
ID No. 1 6 or a catalytically active portion thereof; 

e) a polypeptide that includes a sequence of amino acids having at 
least about 60%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 
88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% 

20 sequence identity with the sequence of amino acids of a)-d); and/or 

f) a protease domain of a polypeptide encoded by a splice variant 
of a sequence of nucleotides that encodes an MTSP20 of any of a)-e). 

Protease domains of MTSPs vary in size and constitution, including 
insertions and deletions in surface loops. They retain conserved structure, 

25 including at least one of the active site triad, primary specificity pocket, 

oxyanion hole and/or other features of serine protease domains of proteases. 
Thus, for purposes herein, a protease domain is a portion of an MTSP, as 
defined herein, and is homologous to a domain of other MTSP. As with the 
larger class of enzymes of the chymotrypsin (SI) fold (see, e.g., Internet 

30 accessible MEROPS data base), the MTSP protease domains share a high degree 
of amino acid sequence identity. The His, Asp and Ser residues necessary for 
activity are present in conserved motifs. The activation site, whose cleavage 
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creates the N-terminus of a protease domain in multi-chain forms is located in a 
conserved motif and/or location and readily can be identified. 

By active form is meant a form active in vivo and/or in vitro. As 
described herein, a protease domain also can exist as a two-chain or multi-chain 
5 form. At least in vitro, the single chain forms of the SPs and the catalytic 
domains or proteolytically active portions thereof (typically C-terminal 
truncations) exhibit protease activity. Hence provided herein are isolated single 
chain forms of protease domains of MTSP20s and their use in in vitro drug 
screening assays for identification of agents that modulate an activity thereof. 
10 As used herein, the catalytically active domain of an MTSP refers to a 

protease domain. Reference to a protease domain of an MTSP generally refers 
to a single chain form of the domain. If a multi-chain form is intended, it is so- 
specified. 

As used herein, reference to "A" chain, "B" chain and "C" chain 

15 refers to the multiple chains in an MTPS upon activation cleavage of one or more 
protease domain. The full-length molecule can be two chains, three chains or, if 
there are additional cleavages, more chains. Where there are two chains, the 
shortest chain (most N-terminal chain) is called the A chain, the next chain the B 
chain and the third chain the C chain (when both protease domains are 

20 activated). Where a single protease domain or fragment thereof or longer form 
thereof activated and isolated, the two chains are referred to as the A chain and 
B chain, respectively. 

As used herein, activation cleavage refers to the cleavage of the protease 
at an N-terminus of a protease domain. By virtue of the Cys-Cys pairing 

25 between a Cys outside a protease domain and a Cys in the protease domain (in 
this instance Cysjig and Cys47T SEQ ID No. 16), upon cleavage of one activation 
site the resulting polypeptide has two chains ("A" chain and the "B" chain, 
which is the protease domain); upon cleavage of both domains the resulting 
polpeptide can have three chains (an A, B and C chain). Cleavage can be 

30 effected by another protease or autocatalytically. 

As used herein, a two-chain form of a protease domain refers to a two- 
chain form that is formed upon activation cleavage of one cleavage site in the 
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protease in which there is Cys pairing between Cys outside a protease domain 
and an unpaired Cys in a protease domain, which links a protease domain to a 
remaining N-terminal portion of the polypeptide. It also includes forms in which 
the "remaining protion of the polypeptide", is shortened and includes the Cys 
5 outside a protease domain. For example, in PD1, cysteine (C219) in the protease 
domain is predicted to pair with Cioa outside the protease domain, so that a two 
chain form includes C,o3 up to and include the N-terminus of the MTSP20. In 
PD2, the unpaired cysteine IC477) in the protease domain is predicted to pair with 
C37V so that a two chain form includes C,o3 up to and include the N-terminus of 

10 the MTSP2Q. If both protease domains are included, there resulting polypeptide 
contains three chains and includes up to the N-terminus of the MTSP20. N- 
terminal and C-terminal truncated forms are contemplated, such that the 
resulting polypeptide exhibits proteolytic activity. Also contemplated are forms 
that include only one of protease domains, in which case the N-terminus of the 

15 PD2 containing polypeptide will not necessarily be coincident with the N- 

terminus of the polypeptide, but will be up to the N-terminus of the polypeptide 
prior to activation cleavage. 

MTSP20s of interest include those that are activated, active and/or 
expressed In tumor cells (or angiogenic endothelial cells) different from, typically 

20 higher, those in non-tumor (or endothelial) cells; and those from cells in which 
substrates therefor differ in tumor cells from non-tumor cells or differ with 
respect to the substrates, co-factors or receptors, or otherwise alter an activity 
or specificity of an MTSP20. 

As used herein, a human protein is one encoded by nucleic acid, such as 

25 DNA, present in the genome of a human, including all allelic variants and 

conservative variations as long as they are not variants found in other mammals. 

As used herein, a "nucleic acid encoding a protease domain or 
catalytically active portion of a SP*' shall be construed as referring to a nucleic 
acid encoding only the recited single chain protease domain or active portion 

30 thereof, and not the other contiguous portions of the SP as a continuous 
sequence. 
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As used herein, catalytic activity refers to an activity of the MTSP as a 
protease. Function of the MTSP refers to its function in tumor biology, including 
promotion of or involvement in initiation, growth or progression of tumors, and 
also roles in signal transduction. Catalytic activity refers to the activity of an 
5 MPSP as a protease as assessed in in vitro proteolytic assays that detect 
proteolysis of a selected substrate. 

As used herein, a CUB domain is a motif that mediates protein-protein 
interactions in complement components C1r/C1s and has also been identified in 
various proteins involved in developmental processes. 
10 As used herein, "LDLR" refers to a low density lipoprotein receptor 

domain, which mediates binding to an LDL receptor. 

As used herein, a zymogen is an inactive precursor of a proteolytic 
enzyme or proteolytic domain thereo. Such precursors are generally larger, 
although not necessarily larger than the active form. With reference to serine 
1 5 proteases, zymogens are converted to active enzymes by specific cleavage, 

including catalytic and autocatalytic cleavage, or by binding of an activating co- 
factor, which generates an active enzyme. A zymogen, thus, is an enzymatically 
inactive protein that is converted to a proteolytic enzyme by the action of an 
activator. 

20 As used herein, "disease or disorder" refers to a pathological condition in 

an organism resulting from, e.g., infection or genetic defect, and characterized 

by identifiable symptoms. 

As used herein, neoplasm (neoplasia) refers to abnormal new growth, and 

thus means the same as tumor, which can be benign or malignant. Unlike 
25 hyperplasia, neoplastic proliferation persists even in the absence of the original 

stimulus. 

As used herein, neoplastic disease refers to any disorder involving cancer, 
including tumor development, growth, metastasis and progression. 

As used herein, cancer refers to a general term for diseases caused by 
30 any type of malignant tumor. 
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As used herein, malignant, as applies to tumors, refers to primary tumors 
that have the capacity of metastasis with loss of growth contro/ and positional 
control. 

As used herein, an anti-cancer agent (used interchangeably with "antl- 
5 tumor or anti-neoplastic agent") refers to any agents used in the anti-cancer 
treatment. These include any agents, when used alone or in combination with 
other compounds, that can alleviate, reduce, ameliorate, prevent, or place or 
maintain in a state of remission of clinical symptoms or diagnostic markers 
associated with neoplastic disease, tumor and cancer, and can be used in 

10 methods, combinations and compositions provided herein. Non-limiting 

examples of anti-neoplastic agents include antt-angiogenic agents, alkylating 
agents, antimetabolites, certain natural products, platinum coordination 
complexes, anthracenediones, substituted ureas, methylhydrazine derivatives, 
adrenocortical suppressants, certain hormones, antagonists and anti-cancer 

15 polysaccharides. 

As used herein, a splice variant refers to a variant produced by differential 
processing of a primary transcript of genomic nucleic acid, such as DNA, that 
results in more than one type of mRNA. Splice variants of SPs are provided 
herein. 

20 As used herein, endotheliase refers to a mammalian protein, including a 

human protein, that has a transmembrane domain and is expressed on the 
surface of endothelial cells and includes a protease domain, particularly an 
extracellular protease domain, that is typically a serine protease. Thus, 
reference, for example, to endotheliase encompasses all proteins encoded by the 

25 endotheliase gene family, or an equivalent molecule obtained from any other 

source or that has been prepared synthetically or that exhibits the same activity. 
The endotheliase gene family are transmembrane proteases expressed in 
endothelial cells. These proteases include serine proteases. When more 
particularity is need, with respect to the protein provided h'erien, it refers to 

30 proteins that have these features and also include a protease domain that 
exhibits sequence homology to a protease domain of the endotheliase 
exemplified herein. Sequence homology means sequence identity along its 
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length when aligned to maximize identity of at least about 25%, 40%, 60%, 
80%, 90% or greater number of residues. Sequence homology also is assessed 
by determining whether the encoding sequences of nucleic acids hybridize under 
conditions of at least moderate, or for more closely related proteins, high 
5 stringency to the nucleic acid molecules provided herein or to those that encode 
the same proteins but differ in sequence by virtue of the degeneracy of the 
genetic code. In addition, endotheliases encompass endotheliases with amino 
acid substitutions, including those set forth in Table 1 , that do not substantially 
alter its proteolytic activity. Suitable substitutions of amino acids are known to 

10 those of skill in this art and may be made generally without altering the biological 
activity of the resulting molecule. Those of skill in this art recognize that, in 
general, single amino acid substitutions in non-essential regions of a polypeptide 
do not substantially alter biological activity (see, e.g. . Watson et aL Motecular 
Btoiogy of the Gene, 4th Edition, 1987, The Benjamin/Cummings Pub. co., 

15 p. 224). Also Included within the definition, is the catalytically active fragment 
of an endotheliase. 

As used herein, a protease domain of an endotheliase refers to a 
polypeptide portion of the endotheliase that exhibits protease activity. A 
protease domain is a polypeptide that includes at least the minimum number of 

20 amino acids, generally more than 50 or 100, required for protease activity. 
Protease activity may be assessed empirically, such as by testing the 
polypeptide for its ability to act as a protease. Assays, such as in the assays 
described in the EXAMPLES may be used. Furthermore, since proteases, 
particularly serine proteases, have characteristic structures and sequences or 

25 motifs, a protease domain can readily identified by such structure and sequence 
or motif. 

As used herein, a catalytically active portion of a protease domain of 
endotheliase refers to a protease domain or portion thereof that exhibits serine 
proteolytic activity. Hence it is at least the minimal portion or portions where, as 
30 in this intance, there are a plurality of protease domains, of an extracellular 
domain that exhibits proteolytic activity as assessed by standard assays. 
Smaller portions thereof that retain protease activity are contemplated. Protease 
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domains vary in size and constitution, including insertions and deletions rn 
surface loops. Such domains exhibit conserved structure, including at least one 
structural feature, such as the active site triad, primary specificity pocket, 
oxyanion hole and/or other features of serine protease domains of proteases. 
5 Thus, for purposes herein, a protease domain is a portion of an endotheliase, as 
defined herein, that is homologous in terms of structural features and retention 
of sequence of similarity or homology the protease domain of chymotrypsin or 
trypsin. 

As used herein, angiogenesis is intended to broadly encompass the 
1 0 totality of processes directly or indirectly involved in the establishment and 

maintenance of new vasculature (neovascularization), including, but not limited 

to, neovascularization associated with tumors. 

As used herein, anti-angiogenic treatment or agent refers to any 

therapeutic regimen and compound, when used alone or in combination with 
1 5 other treatment or compounds, that can alleviate, reduce, ameliorate, prevent, or 

place or maintain in a state of remission of clinical symptoms or diagnostic 

markers associated with undesired and/or uncontrolled angiogenesis. Thus, for 

purposes herein an anti-angiogenic agent refers to an agent that inhibits the 

establishment or maintenance of vasculature. Such agents include, but are not 
20 limited to, anti-tumor agents, and agents for treatments of other disorders 

associated with undesirable angiogenesis, such as diabetic retinopathies, 

restenosis, hyperproliferative disorders and others. 

As used herein, non-anti-angiogenic anti-tumor agents refer to anti-tumor 

agents that do not act primarily by inhibiting angiogenesis. 
25 As used herein, pro-angiogenic agents are agents that promote the 

establishment or maintenance of the vasculature. Such agents include agents 

for treating cardiovascular disorders, including heart attacks and strokes. 

As used herein, undesired and/or uncontrolled angiogenesis refers to 

pathological angiogenesis wherein the influence of angiogenesis stimulators 
30 outweighs the influence of angiogenesis inhibitors. As used herein, deficient 

angiogenesis refers to pathological angiogenesis associated with disorders where 
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there is a defect in normal angiogenesis resulting in aberrant angiogenesis or an 
absence or substantial reduction in angiogenesis. 

As used herein, a protease domain oi an SP protein refers to a domain of 
an SP that exhibits proteolytic activitv- Hence it is at least a minimal portion of 
5 the protein that exhibits proteolytic activity as assessed by standard assays in 
vitro. It refers, herein, to a single chain form and also the multi-chain activated 
forms. Exemplary protease domains include at least a sufficient portion of 
sequences of amino acids set forth in SEQ ID No. 6 (encoded by nucleotides in 
SEQ ID No. 5) to exhibit protease activity. 

10 Also contemplated are nucleic acid molecules that encode a polypeptide 

that has proteolytic activity in an in vitro proteolysis assay and that have at least 
60%, 70%, 75%, 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 
90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence 
identity with the full-length or a protease domain of an MTSP20 polypeptide or 

1 5 other domain thereof, or that hybridize along their full-length or along at least 
about 70%, 80% or 90% of the full-length to a nucleic acids that encode a 
protease domain or other domain, particularly under conditions of moderate, 
generally high, stringency. 

For protease domains, residues at the N-terminus can be critical for 

20 activity. It is shown herein that a protease domain of the single chain form of 
the MTSP20 protease is catalytically active. Hence, a protease domain generally 
requires the N-terminal amino acids thereof for activity; the C-terminus portion 
can be truncated. The amount that can be removed can be determined 
empirically by testing the polypeptide for protease activity in an in vitro assay 

25 that assesses catalytic cleavage. 

Hence smaller portions of protease domains, particularly single chain 
domains, thereof that retain protease activity are contemplated. Such smaller 
versions generally are C-terminal truncated versions of the protease domains. 
Such domains exhibit conserved structure, including at least one structural 

30 feature, such as the active site triad, primary specificity pocket, oxyanion hole 
and/or other features of serine protease domains of proteases. Thus, for 
purposes herein, a protease domain is a single chain portion of an MTSP20 
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domain, as defined herein, but is homologous in its structural features and 
retention of sequence of similarity or homology to the protease domain of 
chymotrypsin or trypsin. A protease domain polypeptide can exhibit proteolytic 
activity as a single chain. 
5 As used herein, "homologous" means about greater than 25% nucleic 

acid sequence identity, such as 25% 40%. 60%, 70%, 80%, 90% or 95%. If 
necessary the percentage homology will be specified. The terms "homology" 
and "identity" are often used interchangeably. In general, sequences are aligned 
so that the highest order match is obtained {see, e.g. : Computational Molecular 

10 Biology, Lesk, A.M., ed., Oxford University Press, New York, 1988; 

Blocomputing: Informatics and Genome Projects, Smith, D-W., ed., Academic 
Press, New York, 1993; Computer Analysis of Sequence Data, Parti. Griffin, 
A.M., and Griffin, H.G., eds., Humana Press, New Jersey, 1994; Sequence 
Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987; and 

15 Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton 
Press, New York, 1991; Carillo et al. {^SQZ) SIAM J Applied Math 48'A012). 
By sequence identity, the number of conserved amino acids are determined by 
standard alignment algorithms programs, and are used with default gap penalties 
established by each supplier. Substantially homologous nucleic acid molecules 

20 would hybridize typically at moderate stringency or at high stringency all along 
the length of the nucleic acid or along at least about 70%, 80% or 90% of the 
full-length nucleic acid molecule of interest. Also contemplated are nucleic acid 
molecules that contain degenerate codons in place of codons in the hybridizing 
nucleic acid molecule. 

25 Whether any two nucleic acid molecules have nucleotide sequences that 

are at least, for example, 80%, 85%, 90%, 95%, 96%, 97%. 98% or 99% 
"identical" can be determined using known computer algorithms such as the 
"FAST A" program, using for example, the default parameters as in Pearson et 
al. (1988) Proc. Nati Acad. Sci. USA ^5:2444 (other programs include the GCG 

30 program package (Devereux, J., et al.. Nucleic Acids Research 12fl}:3&7 

(1984)), BLASTP, BLASTN, FASTA (Atschul, S.F., et al., J Molec Biol 215:403 
(1990); Guide to Huge Computers, Martin J. Bishop, ed.. Academic Press, San 
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Diego. 1994, and Carilto ef a/. (1988) S!AM J Applied Math For 
example, the BLAST function of the National Center for Biotechnology 
Information database can be used to determine identity. Other commercially or 
publicly available programs include, DNAStar "MegAlign" program (Madison, Wl) 
5 and the University of Wisconsin Genetics Computer Group (UWG) "Gap" 

program (Madison Wl)). Percent homology or identity of proteins and/or nucleic 
acid molecules can be determined, for example, by comparing sequence 
information using a GAP computer program {e.g., Needleman ef ai (1970) J, 
Moi Biol. 48:443, as revised by Smith and Waterman ((1981) Adv. AppL Math. 

10 2:482). Briefly, the GAP program defines similarity as the number of aligned 
symbols (i.e., nucleotides or amino acids) which are simitar, divided by the total 
number of symbols in the shorter of the two sequences. Default parameters for 
the GAP program can include: (1) a unary comparison matrix (containing a value 
of 1 for identities and 0 for non-identities) and the weighted comparison matrix 

15 of Gribskov et ai. (1986) Nud. Acids Res. 14:6745, as described by Schwartz 
and Dayhoff , eds., A TLAS OF PROTEIN SEQUENCE AND STRUCTURE, National 
Biomedical Research Foundation, pp. 353-358 (1979); (2) a penalty of 3.0 for 
each gap and an additional 0.10 penalty for each symbol in each gap; and (3) no 
penalty for end gaps. Therefore, as used herein, the term "identity" represents a 

20 comparison between a test and a reference polypeptide or polynucleotide. 

As used herein, the term "at least 90% identical to" refers to percent 
identities from 90 to 99.99 relative to the reference polypeptides. Identity at a 
level of 90% or more is indicative of the fact that, assuming for exemplification 
purposes a test and reference polynucleotide length of 100 amino acids are 

25 compared, no more than 10% (i.e., 10 out of 100) of amino acids in the test 

polypeptide differs from that of the reference polypeptides. Similar comparisons 
can be made between a test and reference polynucleotides. Such differences 
can be represented as point mutations randomly distributed over the entire 
length of an amino acid sequence or they can be clustered in one or more 

30 locations of varying length up to the maximum allowable, e.g. 10/100 amino 

acid difference (approximately 90% identity). Differences are defined as nucleic 
acid or amino acid substitutions, or deletions. At the level of homologies or 
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identities above about 85-90%, the result should be independent of the program 
and gap paranrieters set; such high levels of identity can be assessed readily, 
often without relying on software. 

As used herein, primer refers to an oligonucleotide containing two or 
5 more deoxyribonucleotides or ribonucleotides, typically more than three, from 
which synthesis of a primer extension product can be initiated. Experimental 
conditions conducive to synthesis include the presence of nucleoside 
triphosphates and an agent for polymerization and extension, such as DNA 
polymerase, and a suitable buffer, temperature and pH. 

10 As used herein, animals include any animal, such as, but not limited to, 

goats, cows, deer, sheep, rodents, pigs and humans. Non-human animals 
exclude humans as the contemplated animal. The SPs provided herein are from 
any source, animal, plant, prokaryotic and fungal. Most MTSP20s are of animal 
origin, including mammalian origin. 

15 As used herein, genetic therapy involves the transfer of heterologous 

nucleic acid, such as DNA, into certain cells, target cells, of a mammal, 
particularly a human, with a disorder or conditions for which such therapy is 
sought. The nucleic acid, such as DNA, is introduced into the selected target 
cells in a manner such that the heterologous nucleic acid, such as DNA, is 

20 expressed and a therapeutic product encoded thereby is produced. 

Alternatively, the heterologous nucleic acid, such as DNA, can in some manner 
mediate expression of DNA that encodes the therapeutic product, or it can 
encode a product, such as a peptide or RNA that In some manner mediates, 
directly or indirectly, expression of a therapeutic product. Genetic therapy can 

25 also be used to deliver nucleic acid encoding a gene product that replaces a 
defective gene or supplements a gene product produced by the mammal or the 
cell in which it is Introduced. The introduced nucleic acid can encode a 
therapeutic compound, such as a growth factor inhibitor thereof, or a tumor 
necrosis factor or inhibitor thereof, such as a receptor therefor, that is not 

30 normally produced in the mammalian host or that is not produced in 

therapeutically effective amounts or at a therapeutically useful time. The 
heterologous nucleic acid, such as DNA, encoding the therapeutic product can 
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be modified prior to introduction into the cells of the afflicted host in order to 
enhance or otherwise alter the product or expression thereof. Genetic therapy 
can also involve delivery of an inhibitor or repressor or other modulator of gene 
expression. 

5 As used herein, heterologous nucleic acid is nucleic acid that are not 

normally produced in vivo by the cell in which it is expressed or that mediates or 
encodes mediators that alter expression of endogenous nucleic acid, such as 
DNA, by affecting transcription, translation, or other regulatable biochemical 
processes. Heterologous nucleic acid, such as DNA, can also be referred to as 

10 foreign nucleic acid, such as DNA. Any nucleic acid, such as DNA, that one of 
skill in the art would recognize or consider as heterologous or foreign to the cell 
in which the nucleic acid is expressed is herein encompassed by heterologous 
nucleic acid; heterologous nucleic acid includes exogenously added nucleic acid 
that is also expressed endogenously. Examples of heterologous nucleic acid 

15 include, but are not limited to, nucleic acid that encodes traceable marker 
proteins, such as a protein that confers drug resistance, nucleic acid that 
encodes therapeutically effective substances, such as anti-cancer agents, 
enzymes and hormones, and nucleic acid, such as DNA, that encodes other 
types of proteins, such as antibodies. Antibodies that are encoded by 

20 heterologous nucleic acid can be secreted or expressed on the surface of the cell 
in which the heterologous nucleic acid has been introduced. Heterologous 
nucleic acid is generally not endogenous to the celt into which it is introduced, 
but has been obtained from another cell or prepared synthetically. Generally, 
although not necessarily, such nucleic acid encodes RNA and proteins that are 

25 not normally produced by the cell in which it is now expressed. 

As used herein, a therapeutically effective product for gene therapy is a 
product that is encoded by heterologous nucleic acid, typically DNA, that, upon 
introduction of the nucleic acid into a host, a product is expressed that 
ameliorates or eliminates the symptoms, manifestations of an inherited or 

30 acquired disease or that cures the disease. Also included are biologically active 
nucleic acid molecules, such as RNAi and antisense. 
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As used herein, recitation that a polypeptide consists essentially of a 
protease domain means that the only SP portion of the polypeptide is a protease 
domain or a catalytically active portion thereof. The polypeptide can optionally, 
and generally will, include additional non-SP-derived sequences of amino acids. 
5 As used herein, cancer or tumor treatment or agent refers to any 

therapeutic regimen and/or compound that, when used alone or in combination 
with other treatments or compounds, can alleviate, reduce, ameliorate, prevent, 
or place or maintain In a state of remission of clinical symptoms or diagnostic 
markers associated with deficient angiogenesis. 

10 As used herein, domain refers to a portion of a molecule, e.g,, a protein 

or the encoding nucleic acid, that is structurally and/or functionally distinct from 
other portions of the molecule. 

As used herein, protease refers to an enzyme catalyzing hydrolysis of 
proteins and/or peptides. It includes zymogen forms and activated forms 

1 5 thereof. For clarity, reference to protease refers to all forms, and particular 
forms will be specifically designated. For purposes herein, a protease domain 
includes single and multi-chain forms of a protease domain of an SP protein. For 
MTSP20, a protease domain also includes single and two chain forms and multi- 
chain forms of a protease domain. 

20 As used herein, nucleic acids include DNA, RNA and analogs thereof, 

including peptide nucleic acids (PNA) and mixtures thereof. Nucleic acids can be 
single or double-stranded. When referring to probes or primers, optionally 
labeled, with a detectable label, such as a fluorescent or radiolabel, single- 
stranded molecules are contemplated. Such molecules are typically of a length 

25 such that their target is statistically unique or of low copy number (typically less 
than 5, generally less than 3) for probing or priming a library. Generally a probe 
or primer contains at least 14, 16 or 30 contiguous of sequence complementary 
to or identical a gene of interest. Probes and primers can be 10, 20, 30, 50, 
100 or more nucleic acids long. 

30 As used herein, nucleic acid encoding a fragment or portion of an SP 

refers to a nucleic acid encoding only the recited fragment or portion of SP, and 
not the other contiguous portions of the SP. 
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As used herein, operative linkage of heterologous nucleic acids to 
regulatory and effector sequences of nucleotides, such as promoters, enhancers, 
transcriptional and translational stop sites, and other signal sequences refers to 
the relationship between such nucleic acid, such as DNA, and such sequences of 
5 nucleotides. Thus, operatively linked or operationally associated refers to the 
functional relationship of nucleic acid, such as DNA, with regulatory and effector 
sequences of nucleotides, such as promoters, enhancers, transcriptional and 
translational stop sites, and other signal sequences. For example, operative 
linkage of DNA to a promoter refers to the physical and functional relationship 

10 between the DNA and the promoter such that the transcription of such DNA is 
initiated from the promoter by an RNA polymerase that specifically recognizes, 
binds to and transcribes the DNA. In order to optimize expression and/or in vitro 
transcription, it can be necessary to remove, add or alter 5' untranslated portions 
of the clones to eliminate extra, potential inappropriate alternative translation 

15 initiation {i.e., start) codons or other sequences that can interfere with or reduce 
expression, either at the level of transcription or translation. Alternatively, 
consensus ribosome binding sites (see, e.g., Kozak J. Biol. Chem. 265:19867- 
19870 (1991)) can be inserted immediately 5' of the start codon and can 
enhance expression. The desirability of (or need for) such modification can be 

20 empirically determined. 

As used herein, a sequence complementary to at least a portion of an 
RNA, with reference to antisense oligonucleotides, means a sequence having 
sufficient complementariiy to be able to hybridize with the RNA, generally under 
moderate or high stringency conditions, forming a stable duplex; in the case of 

25 double-stranded SP antisense nucleic acids, a single strand of the duplex DNA 
(or dsRNA) can thus be tested, or triplex formation can be assayed. The ability 
to hybridize depends on the degree of complementariiy and the length of the 
antisense nucleic acid. Generally, the longer the hybridizing nucleic acid, the 
more base mismatches with a SP encoding RNA it can contain and still form a 

30 stable duplex (or triplex, as the case can be). One skilled in the art can ascertain 
a tolerable degree of mismatch by use of standard procedures to determine the 
melting point of the hybridized complex. 
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For purposes herein, amino acid substitutions can be made in any SPs 
and protease domains thereof provided that the resulting protein exhibits 
protease activity. Amino acid substitutions contemplated include conservative 
substitutions, such as those set forth in Table 1 , which do not eliminate 
5 proteolytic activity. As described herein, substitutions that alter properties of 
the proteins, such as removal of cleavage sites and other such sites are also 
contemplated; such substitutions are generally non-conservative, but can be 
readily effected by those of skill in the art. 

Suitable conservative substitutions of amino acids are known to those of 
10 skill in this art and can be made generally without altering the biological activity, 
for example enzymatic activity, of the resulting molecule. Those of skill in this 
art recognize that, in general, single amino acid substitutions in non-essential 
regions of a polypeptide do not substantially alter biological activity (see, e.g., 
Watson et a/. Molecular Biology of the Gene, 4th Edition, 1 987, The 
15 Benjamin/Cummings Pub. co., p.224). Also included within the definition, is the 
catalytically active fragment of an SP, particularly a single chain protease 
portion. Conservative amino acid substitutions are made, for example, in 
accordance with those set forth in TABLE 1 as follows: 

TABLE 1 

20 



25 



30 



35 



Original residue 


Conservative substitution 


Ala (A) 


Gly; Ser. Abu 


Arg (R) 


Lys, om 


Asn (N) 


Gin; His 


Cys (C) 


Ser 


Gin (Q) 


Asn 


Glu (E) 


Asp 


Gly (G) 


Ala; Pro 


His (H) 


Asn; Gin 


lie (1} 


Leu; Val; Met; NIe; Nva 


Leu (L) 


lie; Val; Met; NIe; Nv 


Lys (K) 


Arg; Gin; Glu 


Met (Ml 


Leu; Tyr; lie; NLe Val 


Ornithine 


Lys; Arg 


Phe {Fl 


Met; Leu; Tyr 


Ser (S) 


Thr 


Thr (T) 


Ser 


Trp |W) 


Tyr 


Tyr (V) 


Trp; Phe 


Val (V) 


lie; Leu; Met; NIe; Nv 
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Other substitutions are also permissible and can be determined empirically or in 
accord with known conservative substitutions. 

As used herein, Abu is 2-aminobutvrlc acid; Orn is ornithine. 
As used herein, the amino acids, which occur in the various amino acid 
5 sequences appearing herein, are identified according to their well-lcnown, three- 
letter or one-letter abbreviations. The nucleotides, which occur in the various 
DNA fragments, are designated with the standard single-letter designations used 
routinely in the art. 

As used herein, a probe or primer based on a nucleotide sequence 

10 disclosed herein, includes at least 10, 14, typically at least 16 contiguous 

sequence of nucleotides of SEQ ID No. 5 or SEQ ID No. 15, and probes of at 
least 30, 50 or 100 contiguous sequence of nucleotides of SEQ ID No. 5 or SEQ 
ID No. 15. The length of the probe or primer for unique hybridization is a 
function of the complexity of the genome of interest. 

15 As used herein, amelioration of the symptoms of a particular disorder by 

administration of a particular pharmaceutical composition refers to any lessening, 
whether permanent or temporary, lasting or transient, that can be attributed to 
or associated with administration of the composition. 

As used herein, antisense polynucleotides refer to synthetic sequences of 

20 nucleotide bases complementary to mRNA or the sense strand of double- 
stranded DNA. Admixture of sense and antisense polynucleotides under 
appropriate conditions leads to the binding of the two molecules, or 
hybridization. When these polynucleotides bind to {hybridize with) mRNA, 
inhibition of protein synthesis (translation) occurs. When these polynucleotides 

25 bind to double-stranded DNA, inhibition of RNA synthesis (transcription) occurs. 
The resulting inhibition of translation and/or transcription leads to an inhibition of 
the synthesis of the protein encoded by the sense strand. Antisense nucleic 
acid molecules typically contain a sufficient number of nucleotides to specifically 
bind to a target nucleic acid, generally at least 5 contiguous nucleotides, often at 

30 least 14 or 16 or 30 contiguous nucleotides or modified nucleotides 

complementary to the coding portion of a nucleic acid molecule that encodes a 
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gene of interest, for example, nucleic acid encoding a single chain protease 
donnain of an SP. 

As used herein, an array refers to a collection of elements, such as 
antibodies, containing three or more members. An addressable array is one in 
5 which the members of the array are identifiable, typically by position on a solid 
phase support. Hence, in general the members of the array are immobilized on 
discrete identifiable loci on the surface of a solid phase. 

As used herein, computational docking refers to techniques where 
molecules, for example, a ligand and receptor or active site, are fitted together 

10 based on complementary interactions, for example, steric, hydrophobic or 

electrostatic interactions. In a typical computational docking protocol, the active 
site, or sites deemed important for protein activity, of a protein model are 
defined. A molecular database, such as the Available Chemicals Directory (AGO) 
or any database of molecules, is screened for molecules that complement the 

15 protein model. In computational docking studies, drugs or drug candidates are 
fitted to the structural variant models based on complementary interactions {e.g., 
steric, hydrophobic, or electrostatic interactions). Methods for performing such 
studies are well known and software tools for performing the calculations are 
widely available (M. Lambert, "Docking Conformationatly Flexible Molecules into 

20 Protein Binding Sites" in Practical Application of Computer-Aided Drug Design, 
Charifson, Ed,, Marcel Dekker, NY, pp. 243-303; Kurtz 0 992) Science 
257:1078-1082; Kuntz et at. (1982) J. Mol. Biol. 161:269-288; Stewart et aL 
(1992) Med. Chem. Res. /:439-443; Shoichet et al. (1993) Science 259:1445- 
1450; Shoichet et aK (1991) J. Mo/. Bio/. 227:327-346). 

25 As used herein, antibody refers to an immunoglobulin, whether natural or 

partially or wholly synthetically produced, including any derivative thereof that 
retains the specific binding ability the antibody. Hence antibody includes any 
protein having a binding domain that is homologous or substantially homologous 
to an immunoglobulin binding domain. Antibodies include members of any 

30 immunoglobulin claims, including IgG, IgM, IgA, IgD and IgE. 

As used herein, antibody fragment refers to any derivative of an antibody 
that is less than full-length, retaining at least a portion of the full-length 
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antibody's specific binding ability. Examples of antibody fragments include.but 
are not limited to. Fab, Fab', F(ab)2/ single-chain Fvs (scFV), FV, dsFV diabody 
and Fd fragments. The fragment can include multiple chains linked together, 
such as by disulfide bridges. An antibody fragment generally contains at least 
5 about 50 amino acids and typically at least 200 amino acids. 

As used herein, an Fv antibody fragment is composed of one variable 
heavy domain (V^) and one variable light domain linked by noncovalent 
interactions. 

As used herein, a dsFV refers to an Fv with an engineered intermolecular 
1 0 disulfide bond, which stabilizes the Vh-V^ pair. 

As used herein, an F(ab)2 fragment is an antibody fragment that results 
from digestion of an immunoglobulin with pepsin at pH 4.0-4.5; it can be 
recombinantly expressed to produce the equivalent fragment. 

As used herein. Fab fragments are antibody fragments that result from 
1 5 digestion of an immunoglobulin with papain; they can be recombinantly 
expressed to produce the equivalent fragment. 

As used herein, scFVs refer to antibody fragments that contain a variable 
light chain (VJ and variable heavy chain (V^) covalently connected by a 
polypeptide linker in any order. The linker is of a length such that the two 
20 variable domains are bridged without substantial Interference. Included linkers 
are {Gly-Ser)^ residues with some Glu or Lys residues dispersed throughout to 
increase solubility. 

As used herein, humanized antibodies refer to antibodies that are 
modified to include human sequences of amino acids so that administration to a 
25 human does not provoke an immune response. Methods for preparation of such 
antibodies are known. For example, to produce such antibodies, the encoding 
nucleic acid in the hybridoma or other prokaryotic or eukaryotic cell, such as an 
E. coH or a CHO cell, that expresses the monoclonal antibody is altered by 
recombinant nucleic acid techniques to express an antibody in which the amino 
30 acid composition of the non-variable region is based on human antibodies. 

Computer programs have been designed to identify such non-variable regions. 
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As used herein, diabodies are dimeric scFV; diabodies typically have 
shorter peptide linkers than scFvs, and they generally dimerize. 

As used herein, production by recombinant means by using recombinant 
DNA methods means the use of the well known methods of molecular biology 
5 for expressing proteins encoded by cloned DNA. 

As used herein the term assessing is intended to include 
quantitative and qualitative determination in the sense of obtaining an 
absolute value for an activity of an MTSP, or a domain thereof, present in the 
sample, and also of obtaining an index, ratio, percentage, visual or other value 
10 indicative of the level of the activity. Assessment can be direct or indirect and 
the chemical species actually detected need not of course be the proteolysis 
product itself but can for example be a derivative thereof or some further 
substance. 

As used herein, biological activity refers to the in vivo activities of a 
1 5 compound or physiological responses that result upon in vivo administration of a 
compound, composition or other mixture. Biological activity, thus, encompasses 
therapeutic effects and pharmaceutical activity of such compounds, 
compositions and mixtures; Biological activities can be observed in in vitro 
systems designed to test or use such activities. Thus, for purposes herein the 
20 biological activity of a luciferase is Its oxygenase activity whereby, upon 
oxidation of a substrate, light is produced. 

As used herein, functional activity refers to a polypeptide or portion 
thereof that displays one or more activities associated with a full-length protein. 
Functional activities include, but are not limited to, biological activity, catalytic or 
25 enzymatic activity, antigenicity (ability to bind to or compete with a polypeptide 
for binding to an anti-polypeptlde antibody), immunogenicity, ability to form 
multimers, the ability to specifically bind to a receptor or ligand for the 
polypeptide. 

As used herein, a conjugate refers to the compounds provided herein that 
30 include one or more SPs, including an MTSP20, particularly single chain protease 
domains thereof, and one or more targeting agents. These conjugates include 
those produced by recombinant means as fusion proteins, those produced by 
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chemical means, such as by chemical coupling, through, for example, coupling 
to sulfhydryl groups, and those produced by any other method whereby at least 
one SP, or a domain thereof, is linked, directly or indirectly via linker(s) to a 
targeting agent. 

5 As used herein, a targeting agent is any moiety, such as a protein or 

effective portion thereof, that provides specific binding of the conjugate to a cell 
surface receptor, which, can internalize the conjugate or SP portion thereof. A 
targeting agent can also be one that promotes or facilitates, for example, affinity 
isolation or purification of the conjugate; attachment of the conjugate to a 

10 surface; or detection of the conjugate or complexes containing the conjugate. 

As used herein, an antibody conjugate refers to a conjugate in which the 
targeting agent is an antibody. 

As used herein, derivative or analog of a molecule refers to a portion 
derived from or a modified version of the molecule. 

1 5 As used herein, an effective amount of a compound for treating a 

particular disease is an amount that is sufficient to ameliorate, or in some 
manner reduce the symptoms associated with the disease. Such amount can be 
administered as a single dosage or can be administered according to a regimen, 
whereby it is effective. The amount can cure the disease but, typically, is 

20 administered in order to ameliorate the symptoms of the disease. Repeated 

administration can be required to achieve the desired amelioration of symptoms. 

As used herein, equivalent, when referring to two sequences of nucleic 
acids means that the two sequences in question encode the same sequence of 
amino acids or equivalent proteins. When equivalent is used in referring to two 

25 proteins or peptides, it means that the two proteins or peptides have 

substantially the same amino acid sequence with only amino acid substitutions 
(such as, but not limited to, conservative changes such as those set forth in 
Table 1 above) that do not substantially alter an activity or function of the 
protein or peptide. When equivalent refers to a property, the property does not 

30 need to be present to the same extent (e.g.. two peptides can exhibit different 
rates of the same type of enzymatic activity), but the activities are usually 
substantially the same. Complementary, when referring to two nucleotide 
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sequences, means that the two sequences of nucleotides are capable of 
hybridizing, typically with less than 25%, 15%, 5% or 0% mismatches between 
opposed nucleotides. If necessary the percentage of complementarity will be 
specified. Typically the two molecules are selected such that they will hybridize 
5 under conditions of high stringency. 

As used herein, an agent that modulates an activity of a polypeptide, 
protein or expression of a gene or nucleic acid, either decreases or increases or 
otherwise alters the activity of the polypeptide or protein or, in some manner up- 
or down-regulates or otherwise alters expression of the nucleic acid in a cell. 

10 As used herein, inhibitor of an activity of an f\/ITSP encompasses any 

substance that prohibits or decrease production, post-translational 
modification(s), maturation, or membrane localization of the SP or any substance 
that interferes with or decreases the proteolytic efficacy of thereof, particularly 
of a single chain form in an in vitro screening assay. 

15 As used herein, a method for treating or preventing neoplastic disease 

means that any of the symptoms, such as the tumor, metastasis thereof, the 
vascularization of the tumors or other parameters by which the disease is 
characterized are reduced, ameliorated, prevented, placed in a state of remission, 
or maintained in a state of remission. It also means that the hallmarks of 

20 neoplastic disease and metastasis can be eliminated, reduced or prevented by 
the treatment. Non-limiting examples of the hallmarks include uncontrolled 
degradation of the basement membrane and proximal extracellular matrix, 
migration, division, and organization of the endothelial cells irito new functioning 
capillaries, and the persistence of such functioning capillaries. 

25 As used herein, pharmaceutically acceptable salts, esters or other 

derivatives of the conjugates include any salts, esters or derivatives that can be 
readily prepared by those of skill in this art using known methods for such 
derivatization and that produce compounds that can be administered to animals 
or humans without substantial toxic effects and that either are pharmaceutically 

30 active or are prodrugs. 

As used herein, a prodrug is a compound that, upon in vivo 
administration, is metabolized or otherwise converted to the biologically, 



wo 03/004681 



PCT/US02/21208 



-46- 

pharmaceutically or therapeutically active form of the compound. To produce a 
prodrug, the pharmaceuticatly active compound is modified such that the active 
compound is regenerated by metabolic processes. The prodrug can be designed 
to alter the metabolic stability or the transport characteristics of a drug, to mask 
5 side effects or toxicity, to improve the flavor of a drug or to alter other 
characteristics or properties of a drug. By virtue of knowledge of 
pharmacodynamic processes and drug metabolism in vivo, those of skill in this 
art, once a pharmaceutically active compound is known, can design prodrugs of 
the compound (see, e.g., Nogrady (1985) Medicinal Chemistry A Biochemicai 

10 Approach, Oxford University Press, New York, pages 388-392). 

As used herein, a drug identified by the screening methods provided 
herein refers to any compound that is a candidate for use as a therapeutic or as 
a lead compound for the design of a therapeutic. Such compounds can be small 
molecules, including small organic molecules, peptides, peptide mimetics, 

1 5 antisense molecules or dsRNA, such as RNAi, antibodies, fragments of 

antibodies, recombinant antibodies and other such compounds that can serve as 
drug candidates or lead compounds. 

As used herein, a peptidomimetic is a compound that mimics the 
conformation and certain stereochemical features of the biologically active form 

20 of a particular peptide. In general, peptidomimetics are designed to mimic 

certain desirable properties of a compound, but not the undesirable properties, 
such as flexibility, that lead to a loss of a biologically active conformation and 
bond breakdown. Peptidomimetics may be prepared from biologically active 
compounds by replacing certain groups or bonds that contribute to the 

25 undesirable properties with biotsosteres. Bioisosteres are known to those of 

skill in the art. For example the methylene blotsostere CHjS has been used as an 
amide replacement in enkephalin analogs (see, e.g., Spatola (1983) pp. 267-357 
in Chemistry and Biochemistry of Amino Acids, Peptides, and Proteins, 
Weistein, Ed. volume 7, Marcel Dekker, New York). Morphine, which can be 

30 administered orally, is a compound that is a peptidomimetic of the peptide 
endorphin. For purposes herein, cyclic peptides are included among 
peptidomimetics. 
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As used herein, a promoter region or promoter element refers to a 
segment of DNA or RNA tliat controls transcription of the DNA or RNA to which 
it is operatively linked. The promoter region includes specific sequences that are 
sufficient for RNA polymerase recognition, binding and transcription initiation. 
5 This portion of the promoter region is referred to as the promoter. In addition, 
the promoter region includes sequences that modulate this recognition, binding 
and transcription initiation activity of RNA polymerase. These sequences can be 
cis acting or can be responsive to trans acting factors. Promoters, depending 
upon the nature of the regulation, can be constitutive or regulated. Exemplary 
10 promoters contemplated for use in prokaryotes include the bacteriophage T7 and 
T3 promoters. 

As used herein, a receptor refers to a molecule that has an affinity for a 
given ligand. Receptors can be naturally-occurring or synthetic molecules. 
Receptors can also be referred to in the art as anti-ligands. As used herein, the 

15 receptor and anti-ligand are interchangeable. Receptors can be used in their 

unaltered state or as aggregates with other species. Receptors can be attached, 
covalently or noncovalently, or in physical contact with, a binding member, 
either directly or indirectly via a specific binding substance or linker. Examples 
of receptors, include, but are not limited to: antibodies, ceil membrane receptors 

20 surface receptors and internalizing receptors, monoclonal antibodies and antisera 
reactive with specific antigenic determinants (such as on viruses, cells, or other 
materials), drugs, polynucleotides, nucleic acids, peptides, cofactors, lectins, 
sugars, polysaccharides, cells, cellular membranes, and organelles. 

Examples of receptors and applications using such receptors, include but 

25 are not restricted to: 

a) enzymes: specific transport proteins or enzymes essential to survival 
of microorganisms, which could serve as targets for antibiotic (ligand) selection; 

b) antibodies: identification of a ligand-binding site on the antibody 
molecule that combines with the epitope of an antigen of interest can be 

30 investigated; determination of a sequence that mimics an antigenic epitope can 
lead to the development of vaccines of which the immunogen is based on one or 
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more of such sequences or lead to the development of related diagnostic agents 
or compounds useful in therapeutic treatments such as for auto-immune diseases 
c) nucleic acids: identification of tigand, such as protein or RNA, binding 

sites; 

5 d) catalytic polypeptides: polymers, including polypeptides, that are 

capable of promoting a chemical reaction involving the conversion of one or 
more reactants to one or more products; such polypeptides generally include a 
binding site specific for at least one reactant or reaction intermediate and an 
active functionality proximate to the binding site, in which the functionality is 
10 capable of chemically modifying the bound reactant {see, e.g., U.S. Patent No. 
5,215,899); 

e) hormone receptors: determination of the ligands that bind with high 
affinity to a receptor is useful In the development of hormone replacement 
therapies; for example, identification of ligands that bind to such receptors can 

1 5 lead to the development of drugs to control blood pressure; and 

f) opiate receptors: determination of ligands that bind to the opiate 
receptors in the brain is useful in the development of less-addictive replacements 
for morphine and related drugs. 

As used herein, sample refers to anything that contains an analyte for 
20 which an analyte assay is desired. The sample can be a biological sample, such 
as a biological fluid or a biological tissue. Examples of biological fluids include 
urine, blood, plasma, serum, saliva, semen, stool, sputum, cerebral spinal fluid, 
tears, mucus, sperm, amniotic fluid or the like. Biological tissues are aggregates 
of cells, usually of a particular kind together with their intercellular substance 
25 that form one of the structural materials of a human, animal, plant, bacterial, 
fungal or viral structure, including connective, epithelium, muscle and nerve 
tissues. Examples of biological tissues also include organs, tumors, lymph 
nodes, arteries and individual cell(s). 

As used herein: stringency of hybridization in determining percentage 
30 mismatch is as follows: 

1) high stringency: 0.1 x SSPE, 0.1% SDS, 65^*0 

2) medium stringency: 0.2 x SSPE, 0.1% SDS, bO^C 
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3) low Stringency: 1 .0 x SSPE, 0.1 % SDS, 50°C 
Those of skill in this art know that the washing step selects for stable 
hybrids and also know the ingredients of SSPE (see, e.g., Sambrook, E.F. 
Fritsch, T. Maniatis, in: Molecular Cloning^ A Laboratory Manual, Cold Spring 
5 Harbor Laboratory Press (1989), vol. 3, p. B.I 3, see, also, numerous catalogs 
that describe commonly used laboratory solutions). SSPE is pH 7.4 phosphate- 
buffered 0.18 NaCI. Further, those of skill in the art recognize that the stability 
of hybrids is determined by T„, which is a function of the sodium ion 
concentration and temperature (T^ ^ 81 .B"* C-ie.edogiofNa-')) 4- 0.41((%G + C)- 

10 600/1)), so that the only parameters in the wash conditions critical to hybrid 
stability are sodium ion concentration in the SSPE (or SSC) and temperature. 

It is understood that equivalent stringencies can be achieved using 
alternative buffers, salts and temperatures. By way of example and not 
limitation, procedures using conditions of low stringency are as follows (see also 

15 Shilo and Weinberg, Proc. Natl. Acad. Sci. USA 75:6789-6792 (1981)): Filters 
containing DNA are pretreated for 6 hours at 40°C in a solution containing 35% 
formamide, 5X SSC, 50 mM Tris-HCI {pH 7.5). 6 mM EDTA, 0.1% PVP, 0.1% 
Ficoll, 1 % BSA, and 500 ;/g/ml denatured salmon sperm DNA (10X SSC is 1 .5 
M sodium chloride, and 0.15 M sodium citrate, adjusted to a pH of 7). 

20 Hybridizations are carried out in the same solution with the following 

modifications: 0.02% PVP, 0.02% Ficoll, 0.2% BSA, lOO^g/ml salmon sperm 
DNA, 10% (wt/vol) dextran sulfate, and 5-20 X 10® cpm "P-labeled probe is 
used. Filters are incubated in hybridization mixture for 18-20 hours at 40°C, 
and then washed for 1.5 hours at 55 °C in a solution containing 2X SSC, 25 mM 

25 Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1% SDS. The wash solution is replaced 
with fresh solution and incubated an additional 1 .5 hours at 60°C. Filters are 
blotted dry and exposed for autoradiography. If necessary, filters are washed for 
a third time at 65-68**C and reexposed to film. Other conditions of low 
stringency which can be used are well known in the art (e.g., as employed for 

30 cross-species hybridizations). 

Procedures using conditions of moderate stringency include, for example, 
but are not limited to, procedures using such conditions of moderate stringency. 



wo 03/004681 



PCT/US02/21208 



-50- 

are as follows: Filters containing DNA are pretreated for 6 hours at 55*'C in a 
solution containing 6X SSC, 5X Denhart's solution, 0.5% SDS and TOO //g/ml 
denatured salmon sperm DNA. Hybridizations are carried out in the same 
solution and 5-20 X 10^ cpm ^^P-labeled probe is used. Filters are incubated in 
5 hybridization mixture for 18-20 hours at 55 °C, and then washed twice for 30 
minutes at 60°C in a solution containing 1X SSC and 0.1 % SDS, Filters are 
blotted dry and exposed for autoradiography. Other conditions of moderate 
stringency which can be used are well-known in the art. Washing of filters is 
done at 37**C for 1 hour in a solution containing 2X SSC, 0.1% SDS. 

10 By way of example and not way of limitation, procedures using conditions 

of high stringency are as follows: Prehybridization of filters containing DNA is 
carried out for 8 hours to overnight at 65**C in buffer composed of 6X SSC, 
50 mM Tris-HCI (pH 7.5), 1 mM EDTA. 0.02% PVP, 0.02% Ficoll, 0.02% BSA, 
and 500 //g/ml denatured salmon sperm DNA. Filters are hybridized for 48 hours 

15 at 65 ^C in prehybridization mixture containing ^00 fjgfml denatured salmon 

sperm DNA and 5-20 X 10^ cpm of ^^P-labelad probe. Washing of filters is done 
at 37°C for 1 hour in a solution containing 2X SSC, 0.01% PVP, 0.01% Ficoll, 
and 0-01% BSA. This is followed by a wash in 0.1 X SSC at 50 °C for 45 
minutes before autoradiography. Other conditions of high stringency which can 

20 be used are well known in the art. 

The term substantially identical or substantially homologous or similar 
varies with the context as understood by those skilled in the relevant art and 
generally means at least 60% or 70%, preferably means at least 80%, 85% or 
more preferably at least 90%, and most preferably at least 95% identity. 

25 As used herein, substantially identical to a product means sufficiently 

similar so that the property of interest is sufficiently unchanged so that the 
substantially identical product can be used in place of the product. 

As used herein, substantially pure means sufficiently homogeneous to 
appear free of readily detectable impurities as determined by standard methods 

30 of analysis, such as thin layer chromatography (TLC), gel electrophoresis and 

high performance liquid chromatography (HPLC), used by those of skill in the art 
to assess such purity, or sufficiently pure such that further purification would 
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not detectably alter the physical and chemical properties, such as enzymatic and 
biological activities, of the substance. Methods for purilication of the 
compounds to produce substantially chemically pure compounds are known to 
those of skill in the art. A substantially chemically pure compound can, 
5 however, be a mixture of stereoisomers or Isomers. In such instances, further 
purification might increase the specific activity of the compound. 

As used herein, target cell refers to a cell that expresses an MTSP in vivo. 

As used herein, test substance (or test compound) refers to a chemically 
defined compound {e.g., organic molecules, inorganic molecules, 
10 organic/inorganic molecules, proteins, peptides, nucleic acids, oligonucleotides, 
lipids, polysaccharides, saccharides, or hybrids among these molecules such as 
glycoproteins, etc.) or mixtures of compounds {e.g., a library of test compounds, 
natural extracts or culture supernatants, etc.) whose effect on an MPSP, 
particularly a single chain form that includes a protease domain or a sufficient 
1 5 portion thereof for activity, as determined by an in vitro method, such as the 
assays provided herein, is tested. 

As used herein, the terms a therapeutic agent, therapeutic regimen, 
radioprotectant or chemotherapeutic mean conventional drugs and drug 
therapies, including vaccines, which are known to those skilled in the art. 
20 Radiotherapeutic agents are well known in the art. 

As used herein, treatment means any manner in which the symptoms of a 
condition, disorder or disease are ameliorated or otherwise beneficially altered. 
Treatment also encompasses any pharmaceutical use of the compositions herein. 

As used herein, vector (or plasmid) refers to discrete elements that are 
25 used to introduce heterologous nucleic acid into cells for either expression or 
replication thereof. The vectors typically remain episomal, but can be designed 
to effect integration of a gene or portion thereof into a chromosome of the 
genome. Also contemplated are vectors that are artificial chromosomes, such as 
yeast artificial chromosomes and mammalian artificial chromosomes. Selection 
30 and use of such vehicles are well known to those of skill in the art. An 

expression vector includes vectors capable of expressing DNA that is operatively 
linked with regulatory sequences, such as promoter regions, that are capable of 
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effecting expression of such DNA fragments. Thus, an expression vector refers 
to a recombinant DNA or RNA construct, such as a plasmid, a phage, 
recombinant virus or other vector that, upon introduction into an appropriate 
host cell, results in expression of the cloned DNA. Appropriate expression 
5 vectors are well known to those of skill in the art and include those that are 
replicable in eukaryotic cells and/or prokaryotic cells and those that remain 
episomal or those which integrate into the host cell genome. 

As used herein, protein binding sequence refers to a protein or peptide 
sequence or a portion of other macromolecules that is capable of specific binding 
10 to protein or peptide sequences generally, to a set of protein or peptide 
sequences or to a particular protein or peptide sequence. 

As used herein, epitope tag refers to a short stretch of amino acid 
residues corresponding to an epitope to facilitate subsequent biochemical and 
immunological analysis of the epitope tagged protein or peptide. Epitope tagging 
15 is achieved by including the sequence of the epitope tag to the protein-encoding 
sequence in an appropriate expression vector. Epitope tagged proteins can be 
affinity purified using highly specific antibodies raised against the tags. 

As used herein, metal binding sequence refers to a protein or peptide 
sequence that is capable of specific binding to metal ions generally, to a set of 
20 metal ions or to a particular metal ion. 

As used herein, a cellular extract refers to a preparation or fraction which 
is made from a lysed or disrupted cell. A tissue extract is a preparation or 
fraction which is made from a lysed or disrupted tissue. 

As used herein, a combination refers to any association between two or 
25 among more items. 

As used herein, a composition refers to any mixture. It can be a solution, 
a suspension, liquid, powder, a paste, aqueous, non-aqueous or any combination 
thereof. 

As used herein, fluid refers to any composition that can flow. Fluids thus 
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encompass compositions that are in the form of semi-solids, pastes, solutions, 
aqueous mixtures, gels, lotions, creams and other such compositions. 

As used herein, an agent is said to be randomly selected when the agent 
is chosen randomly without considering the specific sequences involved in the 
5 association of a protein alone or with its associated substrates, binding partners, 
etc. An example of randomly selected agents is the use a chemical library or a 
peptide combinatorial library, or a growth broth of an organism or conditioned 
medium. 

As used herein, an agent is said to be rationally selected or designed 
10 when the agent is chosen on a non-random basis which takes into account the 
sequence of the target site and/or its conformation in connection with the 
agent's action. Agents can be rationally selected or rationally designed by 
employing the peptide sequences that make up these sites. For example, a 
rationally selected peptide agent can be a peptide whose amino acid sequence is 
15 identical to the ATP or calmodulin binding sites or domains. 

For clarity of disclosure, and not by way of limitation, the detailed 
description is divided into the subsections that follow. 
B. MTSP20 polypeptides, muteins, derivatives and analogs thereof 
1. MTSPs 

20 The MTSPs are a family of transmembrane serine proteases that are 

found in mammals and also other species. MTSPs are of interest because they 
appear to be expressed and/or activated at levels different from those in tumor 
cells normal cells, or have functional activity that is different in tumor cells from 
normal cells, such as by an alteration in a substrate therefor, or a cof actor or a 

25 receptor. 

The MTSPs share a number of common structural features including; a 
proteolytic extracellular domain; a transmembrane domain, with a hydrophobic 
domain near the N-terminus; a short cytoplasmic domain; and a variable length 
stem region that may contain additional modular domains. The proteolytic 
30 domains share sequence homology including conserved His, Asp, and Ser 

residues necessary for catalytic activity that are present in conserved motifs. 
The MTSPs are normally synthesized as zymogens and can be activated to multt- 



wo 03/004681 



PCT/US02/21208 



-54- 

chain forms by cleavage. A single chain proteolytic domain can function in vitro 
and, hence is useful in in vitro assays for identifying agents that modulate an 
activity of members of this family. 

For purposes herein, a protease domain of an MTSP20 does not have to 
5 result from activation cleavage, which produces a multi-chain activated product, 
but rather Includes single chain polypeptides where the N-termini include the 
consensus sequence *VVGG, ilVGG, IVGLL, 4ILGG, *IVQG or IIVNG 1 lASG 
or other such motif. Such polypeptides, although not the result of activation 
cleavage and not multi-chain forms, exhibit proteolytic (catalytic) activity. These 

10 protease domain polypeptides are used in assays to screen for agents that 
modulate an activity of the MTSP20. 

The MTSP family Is a target for therapeutic intervention and also some 
members can serve as diagnostic markers for tumor development, growth and/or 
progression. As discussed, the members of this family are involved in 

1 5 proteolytic processes that are implicated in tumor development, growth and/or 
progression. This implication is based upon their functions as proteolytic 
enzymes in processes related to ECM degradation and/or remodeling and 
activation of pro-growth factors, pro-hormones or pro-angiogenic compounds. In 
addition, their levels of expression or level of activation or their apparent activity 

20 resulting from substrate levels or alterations In substrates and levels thereof 

differs in tumor cells and non-tumor cells in the same tissue. Similarly the level 
of co-factors or receptors for these proteases can vary between tumor and non- 
tumor cells. Hence, protocols and treatments that alter their activity, such as 
their proteolytic activities and roles in signal transduction, and/or their 

25 expression, such as by contacting them with a compound that modulates their 
activity and/or expression, could impact tumor development, growth and/or 
progression. Also, in some instances, the level of activation and/or expression 
can be altered in tumors, such as lung carcinoma, colon adenocarcinoma and 
ovarian carcinoma. 
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2. Protease involvement in angiogenesis 

In the initial stages of angiogenesis, microvascular endothelial cells oi 
preexisting blood vessels locally degrade the underlving basal lamina and invade 
Into the stroma of the tissue to be vascularized. It has been shown that this 
5 process requires a wide array of degradative enzymes (Mignatti and Rifkin, 
Enzyme Protein, 49(1'3) :1 17-37 (1996)). Components of the plasminogen 
activator (PA)-plasmin system and the matrix metalloproteinase (MMP) family 
play important roles. PAs trigger a proteinase cascade that results in the 
generation of high local concentrations of piasmin and active MMPs. This 

1 0 increase in proteolytic activity has three major consequences: it permits 
extracellular matrix degradation and invasion of the vessel basal lamina, 
generates extracellular matrix (ECM) degradation products that are chemotactic 
for endothelial cells, and activates and mobilizes growth factors localized in the 
ECM. Five proteases, urokinase-type plasminogen activator, factor XII, protein 

1 5 C, trypsinogen IV, and a membrane-type serine protease 1 (MTSP1 ) that may be 
involved in these processes have been identified. 

At least nine different MMPs have been identified. Among these are 
gelatinase A, which degrades collagen types IV and V, elastin and laminin and 
which is frequently overexpressed in stromal cells of malignant tumors (Vassalli 

20 and Pepper, Nature, 370:14-5 (1994)). Nucleic acid encoding a matrix 

metalloproteinase with a potential transmembrane domain has been cloned (Sato 
et al.. Nature, 370:61-5 (1994)). Expression of the cloned gene product on the 
cell surface induces specific activation of pro-gelatinase A in vitro and enhances 
cellular invasion of the reconstituted basement membrane. Tumor cells of 

25 invasive lung carcinomas, which contain activated forms of gelatinase A, were 
found to express the transcript and the gene product. Inhibition of protease 
activity through the use of wild-type and engineered ecotins results in inhibition 
of rat prostate differentiation and retardation of the growth of human PC-3 
prostatic cancer tumors (see, Takeuchi et al. Proc. NatL Acad. Sci, (USA), 

30 96(20) :1 1054-61 (1999)). 
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Thus, endothelial cells and proteases play key roles in angiogenesis and 
related processes. Furthermore, proteases can serve as therapeutic targets and 
points of intervention in processes relying on protease activities. 

As noted above, aberrant angiogenesis and processes related thereto 
5 have a role in a variety of disorders, including cancers, diabetic retinopathies, 
hyperproliferative disorders, restenosis, and others. As provided herein, because 
proteases are involved either directly or indirectly in angiogenesis, which is 
aberrant in a variety of disorders, altering an activity of proteases involved in 
angiogenesis could treat the resulting disorders. Thus, proteases involved in 
10 angiogenesis can serve as therapeutic targets and also in drug screening 
methodologies to identify compounds that modulate, particularly inhibit, 
angiogenesis. 

MTSP20, provided herein is an endotheiiase. It is an endothelial cell 
transmembrane proteins, which in view of the roleU) of endothelial cells and 

1 5 proteases in angiogenesis and related processes, is directly or indirectly involved 
in the process of angiogenesis or processes related thereto. Thus, MTSP20s are 
therapeutic, diagnostic and/or prognostic targets for intervention in the process 
of angiogenesis, either for inhibition or activation thereof. In addition, protease 
domains (extracellular domains) provide a means to screen for compounds that 

20 modulate angiogenesis. 

The IVITSP20S provided herein and the screening methods provided herein 
permit discovery of candidate compounds that modulate processes involved in 
the establishment and maintenance of the vasculature. These methods provide a 
means to select compounds that selectively bind to the MTSP20s or interact 

25 therewith resulting an increase or decrease in a protease activity of the MTSP20. 
Since MTSP20s occur on endothelial cells, which are intimately involved in 
processes related to establishment and maintenance of the vasculature, the 
compounds identified by the methods herein can have activity as anti-angiogenic 
or pro-angiogenic agents. 
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3. MTSP20 

Thus, MTSP20 is of interest because it is expressed or is active in tumor 
cells. In addition, as noted MTSP20 is also an endotheliase, and is expressed or 
is active in endothelial cells, particularly those involved in angiogenesis 
5 (angiogenic endothelial cellsK The MTSPs provided herein can serve as a 
diagnostic marker for particular tumors, by virtue of a level of activity and/or 
expression or function in a subject (i.e. a mammal, particularly a human) with 
neoplastic disease, compared to a subject or subjects that do not have the 
neoplastic disease. In addition, detection of activity (and/or expression) in a 

10 particular tissue can be indicative of neoplastic disease. 

It is shown herein, that MTSP20s provided herein are expressed and/or 
activated In certain tumors; hence their activation or expression can serve as a 
diagnostic marker for tumor development, growth and/or progression- In other 
instances, the MTSP polypeptide can exhibit altered activity by virtue of a 

1 5 change in activity or expression of a co-factor, a substrate or a receptor. In 
addition, in some instances, these MTSPs and/or variants thereof can be shed 
from cell surfaces. Detection of the shed MTSPs, particularly the extracellular 
protease domains, in body fluids, such as serum, blood, saliva, cerebral spinal 
fluid, synovial fluid and interstitial fluids, urine, sweat and other such fluids and 

20 secretions, can serve as a diagnostic tumor marker. In particular, detection of 
higher levels of such shed polypeptides in a subject compared to a subject 
known not to have any neoplastic disease or compared to earlier samples from 
the same subject, can be indicative of neoplastic disease in the subject, 
a. Polypeptides and muteins 

25 Provided herein are isolated substantially pure single chain and multi-chain 

polypeptides that contain a protease domain of an MTSP20. The polypeptides 
also can include other non-MTSP sequences of amino acids, but includes a 
protease domain or a sufficient portion thereof to exhibit catalytic activity in any 
in vitro assay that assess such protease activity, such as any provided herein. 

30 MTSP20 polypeptides provided herein are expressed or activated by or in 

tumor celts or endothelial, typically at a level that differs from the level in which 
they are expressed by or activated in a non-tumor cell of the same type. Hence, 
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for example, if the MTSP is expressed in an cervical tumor cell, it is expressed or 
active at a different level from the level in non-tumor cervical cells. MTSP20 
expression or activation can be indicative of cervical, lung, esophogeal, colon, 
prostate, uterine, pancreatic, breast and other tumors. 
5 Isolated, substantially pure proteases that include protease domains or a 

catalyttcally active portion thereof are provided. Provided are single chain forms 
and multi-chain forms of the MTSP20. Protease domains can be included in a 
longer protein, and such longer protein is optionally a MTSP20 zymogen up to a 
full-length zymogen. Exemplary MTSP20-encoding nucleic acid and protein 

10 sequences of a protease domain are set forth in SEQ ID Nos. 5 and 6. Full- 
length MTSP20-encoding nucleic acid molecules that contain the sequence set 
forth in SEQ ID No. 5 (with amino acid 247 optionally replaced with I) or SEQ ID 
No. 16 and polypeptides that include the sequence of amino acids set forth in 
SEQ ID No. 6 (with amino acid 247 optionally replaced with I) or SEQ ID No. 16 

15 or catalytically active portions thereof are also provided herein. Thus, an 

MTSP20 polypeptide includes the sequence of amino acids set forth in SEQ ID 
No. 6 or residues 624-642 of SEQ ID No. 16 are provided. Smaller portions 
thereof that retain protease activity are contemplated. 

Substantially purified MTSP20 protease is encoded by a nucleic acid that 

20 hybridizes to a nucleic acid molecule that encodes a protease domain (or two) 
within the sequence of amino acids set forth in SEQ. ID No. 16 under at least 
moderate, generally high, stringency conditions, such that a protease domain 
encoding nucleic acid thereof hybridizes along Its full-length or at least 70%, 
80% or 90% of the full-length. In certain embodiments the substantially 

25 purified MTSP protease is a single chain polypeptide that includes substantially 
the sequence of amino acids set forth in SEQ ID No. 6 or a catalytically active 
portion thereof. 

Also included are substantially purified MTSP20 zymogens, activated 
two, three and other multi-chain forms, single chain protease domains and two 
30 chain protease domains. These polypeptides are encoded by nucleic acids that 
include sequences encoding a protease domain that exhibits proteolytic activity 
and that hybridizes to a nucleic acid molecule having a nucleotide sequence set 
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forth in SEQ ID No. 5 or SEQ ID No. 16, typically under moderate, generally 
under high stringency, conditions and generally along the full-length or along at 
least about 70%, 80% or 90% of the full-length (or substantially the full-length) 
of a protease domain. Splice variants are also contemplated herein. 
5 b. Structural features 

MTSP20 has two trypsin-like serine protease domains in tandem, 
hereafter referred to as MTSP20-PD1 (aa 113 to aa 343) and MTSP20-PD2 (aa 
375 to aa 596). Both protease domains are characterized by the presence of a 
protease activation cleavage site at the beginning of the domain and the catalytic 
10 triad residues (histidine, aspartate and serine) in 3 highly-conserved regions of 
the catalytic domain. 

MTSP20-PD1 has the following features: a protease cleavage site (.-.Kt^^ 
* PiiaQEGN..., where * indicates the cleavage site); catalytic triad residues 

D203 srid S299); the following cysteine pairings within the protease domain 
15 {C^37-Cis3, C233-C305, C264-C2„ and 0295-0824); an unpaired cysteine (Czig) in the 
protease domain is predicted to pair with 0,03. 

MTSP20-PD2 has the following features: the protease cleavage site 
(...R375 * T376AGPQ...); catalytic triad residues (H4,6, D457 and S553); the following 
cysteine pairings within the protease domain (040,-0417, 05,9-0539 and 0549-0577); 
20 the unpaired cysteine (0477) in the protease domain is predicted to pair with C371. 



c. Protease domains 

MTSP protease domains include the single chain protease domains and 
multi-chain domains of MTSP20. In the exemplary MTSP20, there are two 

25 protease domains: residues 113-343 (PD1) of SEQ ID No. 16 (see, also SEQ ID 
No. 6) and residues 376-642 (PD2) of SEQ ID No, 16. Also contemplated are 
catalytically active portions thereof, forms that include both domains and multi- 
chain forms thereof. Each of these can be a single chain form or in a two chain 
form formed by cleavage of the activation cleavage site after binding of an 

30 unpaired Oys in the protease domain (/.e. CySj^g in PD1 and Cys477 in PD2) 

paired with a Cys outside the domain 0,03 for PD1 and 037, for PD2 to produce a 
two chain or multi-chain polypeptide that includes the protease domain plus at 
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least the residues that are between the linked cysteines prior to cleavage and 
after cleavage. 

A protease domain or a plurality thereof can be employed as substantially 
isolated proteins, or as part of a full-length MTSP20, or can be a part of a larger 
5 polypeptide that includes other non-MTSP sequences of annino acids, but 
includes a protease domain or a plurality of domains or a sufficient portion 
thereof to exhibit catalytic activity in any in vitro assay that assess such 
protease activity, such as any provided herein. Also provided are multi-chain 
activated forms of the full length MTSP20 protease and also multi-chain forms of 

10 the protease domains. Thus, isolated, substantially pure proteases that include 
the protease domains (one or two protease domains) or catalyttcally active 
portions thereof as single chain forms of SPs are provided. The protease 
domains can be included in a longer protein, and such longer protein Is optionally 
the activated MTSP20 protein, up to and including a full-length, or an MTSP20 

1 5 zymogen. 

In particular, exemplary protease domains include at least a sufficient 
portion of sequences of amino acids set forth of SEQ ID No. 6 or SEQ ID No. 16 
(encoded by nucleotides in SEQ ID No. 5 or SEQ ID No. 15) to retain catalytic 
activity in vitro. 

20 As noted, protease domains of an MTSP are single-chain polypeptides or 

multi-chain polypeptides, with an N-terminus (such as IV, VV, IL and II or other 
N-terminus) generated at the cleavage site (generally having the consensus 
sequence RIVVGG, RIIVGG, R4IVQ, RilVNG, RilLGG, RIVGLL, RilLGG or a 
variation thereof or a site at an homologous locus; an N-terminus R4 V or Ril, 

25 where the arrow represents the cleavage point) when a zymogen is activated. A 
protease domain of an exemplary l\/ITSP20 is produced by activation cleavage 
between reside 112 and 113 and includes the sequence Knji PujQEGNI 6 (KiP) 
as set forth in SEQ ID No. 16. A second activation cleavage site occurs 
between residues 375 and 376 and includes the Ra^giTjygAGPQ of sequence ID 

30 No. 16. The resulting protease domains span resudes 1 1 3 to 343 of SEQ ID No. 
16 and residues 376-642 of SEQ ID No. 16; see also SEQ ID No. 6, where 
residue 247 (I522 of SEQ ID No. 16) is M. A single chain form includes the 
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sequence of amino acids set forth in SEQ ID No. 6 or as residues 1-343 or 113- 
343 of SEQ ID No. 16 and catalytically active fragments thereof are provided. 
Thus provided are full-length MTSPs, which include at least a residue 624 (as set 
forth in SEQ ID No. 16), particularly residues 624 and one or more contiguous 
5 residues up to 642 (of SEQ ID No. 1 6 or equivalent amino acids) are provided 
{i.e., a splice variant that includes such residue(s)). 

Also provided are single chain protease domains that include residues 1 - 
343 or 1 13-343 of SEQ ID No. 16 or SEQ ID No. 6 and/or catalytically active 
fragments thereof or variants thereof (as described herein), are contemplated 
10 herein. Multiple chain, particularly two and three chain forms, are provided. 

Also provided are polypeptides that are MTSP20 polypeptides as described that 
have 60%, 70%, 80%, 90%, or 95% sequence identity there with and/or is: 

a) a polypeptide that includes the sequence of amino acids set 
forth in SEQ ID No. 6, where residue 247 is M or 1, as the only MTSP20 

1 5 sequence in the polypeptide; 

b) a polypeptide that includes the sequence of amino acids set 
forth in SEQ ID No. 16 or a catalytically active portion thereof; 

c) a polypeptide that includes residue 624 or a contiguous 
sequence of amino acids that includes residue 624 and one or more contingious 

20 residues through residue 642 as set forth in SEQ ID No. 1 6; 

d) a polypeptide encoded by a sequence of nucleotides that 
hybridizes under conditions of low, moderate or high stringency to the sequence 
of nucleotides set forth in SEQ ID No. 5 or to SEQ ID No. 1 5 or hybridizes along 
at least 70%, 80%, 90% or 95% of its full-length that encodes any of the 

25 polypeptides of a), b) or c) 

e) a polypeptide that includes the sequence of amino acids set 
forth in SEQ ID No. 6 or SEQ ID No. 1 6 or a catalytically active portion thereof; 

f) a polypeptide that includes a sequence of amino acids having at 
least about 60%, 70%, 75%, 80%, 81%. 82%, 83%, 84%, 85%, 86%, 87%, 

30 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% 

sequence identity with the sequence of amino acids set forth in SEQ ID No. 6 or 
SEQ ID No. 16; and/or 
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g) a protease domain of a polypeptide encoded by a splice variant 
of a sequence of nucleotides that encodes an MTSP20 of any of a)-f); and . 

h) for use in the screening and diagnostic methods MTSP20 
polypeptides also include those that include the sequence of amino acids set 

5 forth in SEQ ID No. 17 or a protease domain thereof and those that are encoded 
by a sequence of nucleotides that hybridizes under conditions of low, moderate 
or high stringency to the sequence of nucleotides set forth in SEQ ID No; 17 
along at least 70%, 80%, 90%, 95% or 100% of Its full-length to the full-length 
polypeptide or protease domain thereof. 

10 Also provided are polypeptides that are encoded by nucleic acid 

molecules that meet criteria specified below as (a)-(f) in the section designated 
"Nucleic Acids." 

d. Muteins and derivatives 
Full-length MTSP20, zymogen and activated forms thereof and MTSP20 

1 5 protease domains, portions thereof, and muteins and derivatives of such 

polypeptides are provided. The domains, fragments, derivatives or analogs of an 
MTSP20 that are functionally active are capable of exhibiting one or more 
functional activities associated with the MTSP20 polypeptide, such as serine 
protease activity, immunogenicity and antigenicity, are provided. 

20 Among the derivatives are those based on animal MTSP20s, including, 

but are not limited to, rodent, such as mouse and rat; fowl, such as chicken; 
ruminants, such as goats, cows, deer, and sheep; ovine, such as pigs; and 
humans. For example, MTSP20 derivatives can be made by altering their 
sequences by substitutions, additions or deletions. MTSP20 derivatives include, 

25 but are not limited to, those containing, as a primary amino acid sequence, all or 
part of the amino acid sequence of MTSP20, including altered sequences in 
which functionally equivalent amino acid residues are substituted for residues 
within the sequence resulting in a silent change. For example, one or more 
amino acid residues within the sequence can be substituted by another amino 

30 acid of a similar polarity which acts as a functional equivalent. Substitutes for 
an amino acid within the sequence can be selected from other members of the 
class to which the amino acid belongs. For example, the nonpolar (hydrophobic) 
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amino acids include alanine, leucine, isoleucine, valine, proline, phenylalanine, 
tryptophan and methionine. The polar neutral amino acids include glycine, 
serine, threonine, cysteine, tyrosine, asparagine, and glutamine. The positively 
charged Ibasic) amino acids include arginine, lysine and histidine. The negatively 
5 charged (acidic) amino acids include aspartic acid and glutamic acid {see, e.g., 
Table 1). Muteins of the MTSP20 or a domain thereof, such as a protease 
domain, in which up to about 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 
85%, 90% or 95% of the amino acids are replaced with another amino acid are 
provided. Generally such muteins retain at least about 1%, 2%, 3%, 5%, 7%, 

10 8%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80% or 90% of a protease 
activity of the unmutated protein. Those of skill in the art recognize that a 
polypeptide that retains at least 1 % of an activity of the wild-type protease is 
sufficiently active for use in screening assays or for other applications. 

Included among the polypeptides provided herein are the MTSP20 

15 protease domain or a polypeptide with amino acid changes such that the 

specificity and protease activity remains substantially unchanged or changed 
(increased or decreased) by a specified percentage, such as 10%, 20%, 30%, 
40%, 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 
97%. 98%, 99%, 99.5%. In particular, a substantially purified mammalian 

20 MTSP polypeptide is provided that has a transmembrane (TMI domain two 
protease catalytic domains. 

Also provided is a substantially purified protein containing a sequence of 
amino acids that has at least 60%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 
85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%. 94%, 95%, 96%, 97%. 

25 98% or 99% identity to the MTSP20 where the percentage identity is 

determined using standard algorithms and gap penalties that maximize the 
percentage identity. The human l\/ITSP20 polypeptide is included, although other 
mammalian MTSP20 polypeptides are contemplated. The precise percentage of 
identity can be specified if needed. 

30 Muteins in which one or more of the Cys residues, particularly, a residue 

that is paired in the activated two form, but unpaired in a protease domain alone 
is/are replaced with any amino acid, typically, although not necessarily, a 
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conservative amino acid residue, such as Ser, are coritemplated. Muteins of 
MTSP20, particularly those in which Cys residues, such as the Cys^^g and Cys477 
in the single chain protease domains, is replaced with another annino acid, such 
as Ser, Gly or Ala, that does not eliminate the activity, are provided. 
5 Also provided are substantially purified MTSP20 polypeptides and 

functional domains thereof, including catalytically active domains and portions, 
that have at least about 60%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 
86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% 
or 99% sequence identity with a protease domain that includes a sequence of 
10 amino acids set forth in SEQ ID No. 6 or catalytically active fragments thereof. 
Muteins of the protein are also provided in which amino acids are 
replaced with other amino acids. Among the muteins are those in which the Cys 
residues, is/are replaced typically with a conservative amino acid residues, such 
as a serine. Such muteins are also provided herein. Muteins are provided in 
15 which 10%, 20%, 30%, 35%, 40%, 45%, 50% or more of the amino acids are 
replaced but the resulting polypeptide retains at least about 1%, 2%, 3,%, 5%, 
7%, 8%, 10%, 20%, 30%, 35%, 40%, 45%, 50%, 60%, 70%, 80%, 90% or 
95% of the catalytic activity as the unmodified form for the same substrate. 
Muteins can be made by making conservative amino acid substitutions 
20 and also non-conservative amino acid substitutions. For example, amino acid 
substitutions that desirably alter properties of the proteins can be made. In one 
embodiment, mutations that prevent degradation of the polypeptide can be 
made. Many proteases cleave after basic residues, such as R and K: to eliminate 
such cleavage, the basic residue is replaced with a non-basic residue. Also, non- 
25 conservative changes at amino acids outside of a protease domain can be 

effected without altering protease activity. Non-conservative changes at amino 
acids that are responsible for activities other than protease activity may be 
desirable. For example, interaction of a protease with an inhibitor can be 
blocked while retaining catalytic activity by effecting a non-conservative change 
30 at the site of interaction of the inhibitor with the protease. Similarly, receptor 
binding can be altered without altering catalytic activity by effecting a non- 
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conservative or conservative change at a site of interaction of the receptor with 
the protease. 

Antigenic epitopes that contain at least 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 

14, 15, 20, 25, 30, 40. 50, and typically 10-15 amino acids of the MTSP20 

5 polypeptide are provided. These antigenic epitopes are used, for example, to 

raise antibodies. Antibodies specific for each epitope or combinations thereof 

and for single-, two- and other multi-chain forms are also provided. Antibodies 

that bind with higher affinity to an MTSP that includes amino acid 624-642 (or 

includes two protease domains) than to an MTSP20, such as the polypeptide of 

10 SEQ ID No. 17, that lacks such residues are provided- 

4. Nucleic acid molecules, vectors and plasmids, cells and expression 
of MTSP20 polypeptides 

a. Nucleic acid molecules 

Due to the degeneracy of nucleotide coding sequences, other nucleic 
1 5 sequences which encode substantially the same amino acid sequence as a MTSP 
are contemplated. These include but are not limited to nucleic acid molecules 
that include all or portions of MTSP20-encoding genes that are altered by the 
substitution of different codons that encode the amino acid residue within the 
sequence, thus producing a silent change. 
20 1 . Nucleic acids 

Also provided herein are nucleic acid molecules that encode MTSP20 
polypeptides and the encoded proteins. In particular, nucleic acid molecules 
encoding MTSP20 from animals, including splice variants thereof are provided. 
The encoded proteins are also provided. Also provided are functional domains 
25 thereof. For each of the nucleic acid molecules provided, the nucleic acid can be 
DMA or RNA or PNA or other nucleic acid analogs or can include non-natural 
nucleotide bases. Also provided are isolated nucleic acid molecules that include 
a sequence of nucleotides complementary to the nucleotide sequence encoding 
an MTSP. 

30 Also provided are nucleic acid molecules that encode single chain or 

multi-chain MTSP proteases that have proteolytic activity in an in vitro 
proteolysis assay and that have at least 60%, 70%, 75%, 80%, 81%, 82%, 
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83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 
96%, 97%, 98% or 99% sequence identity with the futl-length of a protease 
domain of an MTSP20 polypeptide, or that hybridize along their full-length or 
along at least about 70%, 80% or 90% of the full-length nucleic acid to a 
5 nucleic acids that encode a protease domain, particularly under conditions of 
moderate, generally high, stringency. As above, the encoded polypeptides 
contain the protease as a single chain; activated forms thereof can be produced 
and are provided. 

In one embodiment, a nucleic acid molecule that encodes an MTSP, 

10 designated IVIT5P20 is provided. The nucleic acid molecule includes the open 
reading frame in the sequence of nucleotides set forth in SEQ ID No. B or SEQ (D 
No. 16. Also provided are nucleic acid molecules that hybridize under conditions 
of at least low stringency, moderate stringency, and generally high stringency to 
the following sequence of nucleic acids (SEQ ID No. 5) particularly to the open 

15 reading frame encompassed by nucleotides that encode a single protease domain 
thereof, or any domain of MTSP20. 

In certain embodiments, the isolated nucleic acid fragment hybridizes to 
the nucleic acid having the nucleotide sequence set forth in SEQ ID No. 5 under 
high stringency conditions, and generally contains the sequence of nucleotides 

20 set forth as nucleotides 1-690 in SEQ ID No. 5. The protein contains a 

transmembrane domain (TM) and serine protease domains and can contain 
additional domains. 

Also provided, are muteins of the nucleic acid molecules that encode 
polypeptides in which amino acids are replaced with other amino acids. Among 

25 the muteins are those in which the Cys residue-encoding codons, is/are replaced 
with other amino acid residues, such as a codon encoding a serine. Such 
muteins are also provided herein. Each of such domains is provided herein as 
are nucleic acid molecules that include sequences of nucleotides encoding such 
domains. Some MTSPs can additionally include a LDLR domain, a scavenger- 

30 receptor cysteine rich (SRCR) domain, CUB domain and other domains. 

The isolated nucleic acid fragment is DNA, including genomic or cDNA, or 
is RNA, or can include other components, such as peptide nucleic acid (PNA) 
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and other nucleotide analogs. The isolated nucleic acid can include additional 
components, such as heterologous or native promoters, and other transcriptional 
and translational regulatory sequences, these genes can be linked to other 
genes, such as reporter genes or other indicator genes or genes that encode 
5 Indicators. 

Also provided are nucleic acid molecules that hybridize to the above- 
noted sequences of nucleotides encoding MTSP20 at least at low stringency, 
moderate stringency, and typically at high stringency, and that encode a 
protease domain and/or a plurality of protease domains and/or the full-length 
10 protein or at least 60%, 70%, 80% or 90% of the full-length protease domain or 
other domains of an MTSP20 or a splice variant or allelic variant thereof. 
Generally the molecules hybridize under such conditions along their full-length or 
along at least 70%, 80% or 90% of the full-length for at least one domain and 
encode at least one domain, such as a protease domain or an extracellular 
1 5 domain, of the polypeptide. In particular, such nucleic acid molecules include 
any isolated nucleic acid fragment that encodes at least one domain of a 
membrane serine protease, that (1) contains a sequence of nucleotides that 
encodes the protease or a domain thereof, and (2) is selected from among: 

{a) a sequence of nucleotides that encodes the MTSP20 polypeptide 
20 or a domain thereof that includes a sequence of nucleotides set 

forth in SEQ ID No. 5, particularly nucleotides 1-690, or SEQ ID 
No, 15; 

(b) a sequence of nucleotides that encodes such portion or the full- 
length protease and hybridizes under conditions of high stringency, 

25 generally to nucleic acid that is complementary to a mRNA 

transcript present in a mammalian cell that encodes such protein 
or fragment thereof; 

(c) a sequence of nucleotides that encodes a protease or a domain 
thereof that includes a sequence of nucleotides having at least 

30 about 60%, 70%, 80%, 90% or 95% sequence identity with the 

sequence set forth in SEQ ID No. 5, particularly nucleotides 360- 
1 052 or 1 1 48-1 952 in SEQ ID No. 15; 
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a sequence of nucleotides that encodes a transnnembrane protease 
or domain thereof that includes a sequence of amino acids 
encoded by such portion or the full-length open reading frame; 
a sequence of nucleotides that encodes the MTSP20 polypeptide 
or a domain thereof that includes a sequence of nucleotides set 
having at least about 60%, 70%, 80%, 90% or 95% sequence 
identity with the sequence set forth in SEQ ID No. 5 or SEQ ID No. 
15; and 

a sequence of nucleotides that encodes the MTSP20 polypeptide 

that includes a sequence of amino acids encoded by a sequence of 

nucleotides that encodes such subunit and hybridizes under 

conditions of low, moderate or high stringency to DNA that is 

complementary to the mRNA transcript. 

The isolated nucleic acids can contain least 10 nucleotides, 25 

15 nucleotides, 50 nucleotides, 100 nucleotides, 150 nucleotides, or 200 

nucleotides or more contiguous nucleotides of an MTSP20-encoding sequence, 

or a full-length SP coding sequence. In another embodiment, the nucleic acids 

are smaller than 35, 200 or 500 nucleotides in length. Nucleic acids that 

hybridize to or are complementary to an MTSP20-encoding nucleic acid molecule 

20 can be single or double-stranded. For example, nucleic acids are provided that 

include a sequence complementary to (specifically are the inverse complement 

of) at least 10, 25, 50, 100, or 200 nucleotides or the entire coding region of an 

MTSP20 encoding nucleic acid, particularly a protease domain thereof. For 

MTSP20 the full-length protein or a domain or domains or active f ragment{s) 

25 thereof is(are) also provided. 

2. Probes, primers, antisense oligonucleotides and 
dsRNA 

Also provided are fragments thereof that can be used as probes or 
primers and that contain at least about 10 nucleotides, 14 nucleotides, generally 
30 at least about 1 6 nucleotides, often at least about 30 nucleotides. The length of 
the probe or primer is a function of the size of the genome probed; the larger the 
genome, the longer the probe or primer required for specific hybridization to a 



(d) 
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single site. Those of skill in the art can select appropriately sized probes and 
primers. Generally probes and primers as described are single-stranded. Double 
stranded probes and primers can be used, if they are denatured when used. 

Probes and primers derived from the nucleic acid molecules are provided. 
5 Such probes and primers contain at least 8, 14, 16, 30, 100 or more contiguous 
nucleotides with identity to contiguous nucleotides of an MTSP20, and probes of 
at least 14, 16, 30, 50 or 100 contiguous sequence of nucleotides of SEQ ID 
No. 5 or SEQ ID No. 1 5. The probes and primers are optionally labelled with a 
detectable label, such as a radiolabel or a fluorescent tag, or can be mass 

1 0 differentiated for detection by mass spectrometry or other means. 

Also provided is an isolated nucleic acid molecule that includes the 
sequence of molecules that is complementary to the nucleotide sequence 
encoding MTSP20 or the portion thereof. Double-stranded RNA (dsRNA), such 
as RNAi is also provided. 

15 3. Plasmids, Vectors and Cells 

Plasmids and vectors containing the nucleic acid molecules are also 
provided. Cells containing the vectors, including cells that express the encoded 
proteins are provided. The cell can be a bacterial cell, a yeast cell, a fungal cell, 
a plant celt, an insect cell or an animal cell. Methods for producing an MTSP or 

20 single chain form (or multi-chain form) of a protease domain thereof by, for 
example, growing the cell under conditions whereby the encoded MTSP is 
expressed by the cell, and recovering the expressed protein, are provided herein. 
As noted, for MTSP20, the full-length zymogens and activated proteins and 
activated (two. three or multi-chain) protease and single chain protease domains 

25 are provided. As described herein, the cells are used for expression of the 
protein, which can be secreted or expressed in the cytoplasm. 

As discussed below, the MTSP20 polypeptide, and catalytically active 
portions thereof, can be expressed on the surface of a cell. In addition, all or 
portions thereof can be expressed as a secreted protein using the native signal 

30 sequence or a heterologous signal. Alternatively, all or portions of the 

polypeptide can be expressed as inclusion bodies in the cytoplasm and isolated 
therefrom. The resulting protein can be treated to refold if necessary. 
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The above discussion provides an overview and some details of the 
exemplified MTSP20s. 

C. Tumor specificity and tissue expression profiles 

MTSPs are of interest because they appear to be expressed and/or 
5 activated at different levels in tumor cells from normal cells, or have functional 
activity that is different in tumor cells from normal cells, such as by an alteration 
in a substrate for the MTSP, or a cofactor or receptor of the MTSP. MTSP20 is 
of interest because It is expressed or is active in tumor cells. Hence the MTSPs 
provided herein can serve as diagnostic markers for certain tumors. 

10 Each MTSP has a characteristic tissue expression profile; the MTSPs in 

particular, although not exclusively expressed or activated in tumors, exhibit 
characteristic tumor tissue expression or activation profiles. In some instances, 
MTSPs can have different activity in a tumor cell from a non-tumor cell by virtue 
of a change in a substrate or cofactor or receptor therefor or other factor that 

1 5 would alter the functional activity of the MTSP. Hence each can serve as a 
diagnostic marker for particular tumors, by virtue of a level of activity and/or 
expression or function in a subject (i.e. a mammal, particularly a human) with 
neoplastic disease, compared to a subject or subjects that do not have the 
neoplastic disease. In addition, detection of activity (and/or expression) in a 

20 particular tissue can be indicative of neoplastic disease. Shed MTSPs in body 
fluids can be indicative of neoplastic disease. Also, by virtue of an activity 
and/or expression profiles of each, they can serve as therapeutic targets, such 
as by administration of modulators of an activity thereof, or, as by administration 
of a prodrug specifically activated by one of the MTSPs. 

25 Tissue expression profiles 

MTSP20 

MTSP20 is expressed in many tissues. The MTSP20 transcript is found 
in liver, lymph node, cerebellum, pancreas, prostate, uterus, testis, glands 
(adrenal, thyroid and salivary), thymus, kidney and spleen. Lower transcript 
30 level is found in lung, placenta, bladder, ovary, digestive system, circulatory 

system and other parts of the brain. The MTSP20 transcript is found at a low 
level in a number of normal tissues including breast, prostate, cervix, uterus. 
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ovary, colon, lung, small intestine, stomach, kidney, pancreas, thyroid and 
rectum. MTSP20 is also expressed in certain tumor cell lines including lung 
carcinoma (A5191, colorectal carcinoma {SW480), lymphoma (Raji and Daudi), 
cervical carcinoma (HeLaS3) and leukemia (HL-60, K-562 and MOLT-4) cell lines. 
5 MTSP20 is expressed in endothelial cells; hence it is an endotheliase. 
D. Identification and isolation of MTSP20 polypeptide genes 

The MTSP polypeptides and/or domains thereof, can be obtained by 
methods well known in the art for protein purification and recombinant protein 
expression. Any method known to those of skill in the art for identification of 

10 nucleic acids that encode desired genes can be used. Any method available in 
the art can be used to obtain a full-length {i.e., encompassing the entire coding 
region) cDNA or genomic DNA clone encoding an MTSP polypeptide. For 
example, the polymerase chain reaction (PGR) can be used to amplify a 
sequence that is expressed in normal and tumor celts or tissues, e.g., nucleic 

1 5 acids encoding an MTSP20 polypeptide (SEQ. Nos: 5 and 1 6), in a genomic or 
cONA library. Oligonucleotide primers that hybridize to sequences at the 3' and 
5' termini of the identified sequences can be used as primers to amplify by PGR 
sequences from a nucleic acid sample IRNA or DNA), generally a cDNA library, 
from an appropriate source {e.g., tumor or cancer tissue). 

20 PGR can be carried out, e.g., by use of a Perkin-Elmer Cetus thermal 

cycler and Taq polymerase (Gene Amp"). The DNA being amplified can include 
mRNA or cDNA or genomic DNA from any eukaryotic species. One can choose 
to synthesize several different degenerate primers, for use in the PGR reactions. 
It is also possible to vary the stringency of hybridization conditions used in 

25 priming the PGR reactions, to amplify nucleic acid homologs {e.g., to obtain 
MTSP polypeptide sequences from species other than humans or to obtain 
human sequences with homology to MTSP20 polypeptide) by allowing for 
greater or lesser degrees of nucleotide sequence similarity between the known 
nucleotide sequence and the nucleic acid homolog being isolated. For cross- 

30 species hybridization, low stringency to moderate stringency conditions are 
used. For same species hybridization, moderately stringent to highly stringent 
conditions are used. The conditions can be empirically determined. 
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After successful amplification of the nucleic acid containing all or a 
portion of the identified MTSP polypeptide sequence or of a nucleic acid 
encoding all or a portion of an MTSP polypeptide honnolog, that segment can be 
molecularly cloned and sequenced, and used as a probe to isolate a complete 
5 cDNA or genomic clone. This, in turn, permits the determination of the gene's 
complete nucleotide sequence, the analysis of its expression, and the production 
of its protein product for functional analysis. Once the nucleotide sequence is 
determined, an open reading frame encoding the MTSP polypeptide gene protein 
product can be determined by any method well known in the art for determining 

10 open reading frames, for example, using publicly available computer programs 
for nucleotide sequence analysis. Once an open reading frame is defined, it is 
routine to determine the amino acid sequence of the protein encoded by the 
open reading frame. In this way, the nucleotide sequences of the entire MTSP 
polypeptide genes as well as the amino acid sequences of MTSP polypeptide 

15 proteins and analogs can be identified. 

Any eukaryotic cell potentially can serve as the nucleic acid source for 
the molecular cloning of the MTSP polypeptide gene. The nucleic acids can be 
isolated from vertebrate, mammalian, human, porcine, bovine, feline, avian, 
equine, canine, as well as additional primate sources, insects, plants and other 

20 organisms. The DNA can be obtained by standard procedures known in the art 
from cloned DNA {e.g., a DNA "library"), by chemical synthesis, by cDNA 
cloning, or by the cloning of genomic DNA, or fragments thereof, purified from 
the desired cell (see, e.g., Sambrook et a/. (2001) Molecular Cloning, A 
Laboratory Manual, 3d Ed., Cold Spring Harbor Laboratory Press, Cold Spring 

25 Harbor, New York; Glover, D.M. (ed.), 1985, DNA Cloning: A Practical 
Approach, MRL Press, Ltd., Oxford, U.K. Vol. I, II). Clones derived from 
genomic DNA can contain regulatory and intron DNA regions in addition to 
coding regions; clones derived from cDNA will contain only exon sequences. For 
any source, the gene is cloned into a suitable vector for propagation thereof. 

30 In the molecular cloning of the gene from genomic DNA, DNA fragments 

are generated, some of which will encode the desired gene. 
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The DNA can be cleaved at specific sites using various restriction enzymes. 
Alternatively, one can use DNAse in the presence of manganese to fragment the 
DNA, or the DNA can be physically sheared, for example, by sonication. The 
linear DNA fragments then can be separated according to size by standard 
5 techniques, including but not limited to, agarose and polyacrylamide gel 
electrophoresis and column chromatography. 

Once the DNA fragments are generated, identification of the specific DNA 
fragment containing the desired gene can be accomplished in a number of ways. 
For example, a portion of the MTSP polypeptide (of any species) gene (e.^., a 

10 PCR amplification product obtained as described above or an oligonucleotide 

having a sequence of a portion of the known nucleotide sequence) or its specific 
RNA, or a fragment thereof can be purified and labeled, and the generated DNA 
fragments can be screened by nucleic acid hybridization to the labeled probe 
(Benton and Davis, Science 796:^80 (1977); Grunstein and Hogness, Proc. Natl. 

15 Acad, Sci, U.S.A. 72:3961 (1975)). Those DNA fragments with substantial 
homology to the probe will hybridize. It is also possible to identify the 
appropriate fragment by restriction enzyme dige5tion(s) and comparison of 
fragment sizes with those expected according to a known restriction map if such 
is available or by DNA sequence analysis and comparison to the known 

20 nucleotide sequence of MTSP polypeptide. Further selection can be carried out 
on the basis of the properties of the gene. Alternatively, the presence of the 
gene can be detected by assays based on the physical, chemical, or 
immunological properties of its expressed product. For example, cDNA clones, 
or DNA clones which hybrid-select the proper mRNA, can be selected which 

25 produce a protein that, e.g., has similar or identical electrophoretic migration, 
isolectric focusing behavior, proteolytic digestion maps, antigenic properties, 
serine protease activity. If an anti-MTSP polypeptide antibody is available, the 
protein can be identified by binding of labeled antibody to the putatrvely MTSP 
polypeptide synthesizing clones, in an ELISA (enzyme-linked immunosorbent 

30 assay)-type procedure. 

Alternatives to isolating the MTSP20 polypeptide genomic DNA include, 
but are not limited to, chemically synthesizing the gene sequence from a known 



/ 
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sequence or making cDNA to the mRNA that encodes the MTSP polypeptide. 
For example, RNA for cDNA cloning o1 the MTSP polypeptide gene can be 
isolated from cells expressing the protein. The identified and isolated nucleic 
acids then can be inserted into an appropriate cloning vector. A large number of 
5 vector-host systems known in the art can be used. Possible vectors include, but 
are not limited to, plasmids or modified viruses, but the vector system must be 
compatible with the host cell used. Such vectors include, but are not limited to, 
bacteriophages such as lambda derivatives, or plasmids such as pBR322 or pUC 
plasmid derivatives or the Bluescript vector (Stratagene, La Jolla, CA). The 

10 insertion into a cloning vector can, for example, be accomplished by ligating the 
DNA fragment into a cloning vector which has complementary cohesive termini. 
If the complementary restriction sites used to fragment the DNA are not present 
in the cloning vector, the ends of the DNA molecules can be enzymatically 
modified. Alternatively, any site desired can be produced by ligating nucleotide 

1 5 sequences (linkers) onto the DNA termini; these ligated linkers can include 
specific chemically synthesized oligonucleotides encoding restriction 
endonuclease recognition sequences. In an alternative method, the cleaved 
vector and MTSP polypeptide gene can be modified by homopolymeric tailing. 
Recombinant molecules can be introduced into host cells via transformation, 

20 transfection, infection, electroporation, calcium precipitation and other methods, 
so that many copies of the gene sequence are generated. 

In specific embodiments, transformation of host cells with recombinant 
DNA molecules that incorporate the isolated MTSP polypeptide gene, cDNA, or 
synthesized DNA sequence enables generation of multiple copies of the gene. 

25 Thus, the gene can be obtained in large quantities by growing transform ants, 
isolating the recombinant DNA molecules from the transformants and, when 
necessary, retrieving the inserted gene from the isolated recombinant DNA. 
Identification and isolation of nucleic acid encoding an exemplary MTSP20 is 
described in the EXAMPLES. 
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E. Vectors, plasmids and cells that contain nucleic acids encoding an MTSP 
polypeptide or protease domain thereof and expression of MTSP 
polypeptides 

Vectors and cells 

5 For recombinant expression of one or more of the MTSP polypeptides, the 

nucleic acid containing all or a portion of the nucleotide sequence encoding the 
MTSP polypeptide can be inserted into an appropriate expression vector, i.e.. a 
vector that contains the necessary elements for the transcription and translation 
of the inserted protein coding sequence. The necessary transcriptional and 

10 translational signals can also be supplied by the native promoter for MTSP 
genes, and/or their flanking regions. 

Also provided are vectors that contain nucleic acid encoding the MTSPs. 
Cells containing the vectors are also provided. The cells include eukaryotic and 
prokaryotic cells, and the vectors are any suitable-for use therein. 

1 5 Prokaryotic and eukaryotic cells, including endothelial cells, containing the 

vectors are provided. Such cells include bacterial cells, yeast cells, fungal cells, 
plant cells, insect cells and animal cells. The cells are used to produce an MTSP 
polypeptide or protease domain thereof by (a) growing the above-described cells 
under conditions whereby the encoded MTSP polypeptide or a protease domain 

20 of the MTSP polypeptide is expressed by the cell, and then (b) recovering the 
expressed polypeptide. 

In one embodiment, vectors that include a sequence of nucleotides that 
encode a polypeptide that has protease activity and contains all or a portion of 
only a protease domain, or multiple copies thereof or a fragment with two 

25 protease domains, of an MTSP20 polypeptide are provided. Also provided are 
vectors that Include a sequence of nucleotides that encodes a protease domain 
and additional portions of an MTSP protein up to and including a full length SP 
protein, as well as multiple copies thereof. The vectors can be selected for 
expression of the SP protein or protease domain thereof in the cell or such that 

30 the SP protein is expressed as a secreted protein. Alternatively, the vectors can 
include signals necessary for secretion of encoded proteins. When a protease 
domain is expressed the nucleic acid is linked to nucleic acid encoding a 
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secretion signal, such as the Saccharomyces cerev/siae a mating factor signal 
sequence or a portion thereof, or the native signal sequence. 

A variety of host-vector systems can be used to express the protein 
coding sequence. These include but are not limited to mammalian cell systems 
5 infected with virus [e.g. vaccinia virus, adenovirus, etc.); insect cell systems 
infected with virus [e.g. baculovirus); microorganisms such as yeast containing 
yeast vectors; or bacteria transformed with bacteriophage, DNA, plasmid DNA, 
or cosmid DNA. The expression elements of vectors vary in their strengths and 
specificities. Depending on the host-vector system used, any one of a number 

10 of suitable transcription and translation elements can be used. 

Any methods known to those of skill in the art for the insertion of nucleic 
acid fragments into a vector can be used to construct expression vectors 
containing a chimeric gene containing appropriate transcriptionai/translational 
control signals and protein coding sequences. These methods can include in 

15 vitro recombinant DNA and synthetic techniques and in vivo recombinants 

(genetic recombination). Expression of nucleic acid sequences encoding MTSP 
polypeptide, or domains, derivatives, fragments or homologs thereof, can be 
regulated by a second nucleic acid sequence so that the genes or fragments 
thereof are expressed in a host transformed with the recombinant DNA 

20 molecule(s). For example, expression of the proteins can be controlled by any 
promoter/enhancer known in the art. In a specific embodiment, the promoter is 
not native to the genes for MTSP polypeptide. Promoters which can be used 
include but are not limited to the SV40 early promoter (Bernoist and Chambon, 
Nature 250:304-310 (1981)), the promoter contained in the 3' long terminal 

25 repeat of Rous sarcoma virus {Yamamoto et al., Ce/f 22:787-797 (1980)), the 
herpes thymidine kinase promoter {Wagner et al., Proc. Natl. Acad. Set. USA 
75:1441-1445 (1981)), the regulatory sequences of the metallothronein gene 
(Brinster et al.. Nature 256:39-42 (1982)); prokaryotic expression vectors such 
as the >?-lactamase promoter (Vllla-Kamaroff et al., Proc. Natl. Acad. Sci. USA 

30 75:3727-3731 1978)) or the tac promoter (DeBoer et al., Proc. Nati. Acad. Sci. 
USA 50:21-25 (1983)); see also "Useful Proteins from Recombinant Bacteria": in 
Scientific American 242:79-94 (1980)); plant expression vectors containing the 
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nopaline synthetase promoter (Herrar-Estrella et al.. Nature J02:209-213 (1984)) 
or the cauliflower mosaic virus 35S RNA promoter (Garder et al.. Nucleic Acids 
Res. S:2871 (1981)), and the promoter of the photosynthetic enzyme ribulose 
bisphosphate carboxylase (Herrera-Estrella et al., Nature 310A 15-120 (1984)); 
5 promoter elements from yeast and other fungi such as the Gal4 promoter, the 
alcohol dehydrogenase promoter, the phosphoglycerol kinase promoter, the 
alkaline phosphatase promoter, and the following animal transcriptional control 
regions that exhibit tissue specificity and have been used in transgenic animals: 
eiastase I gene control region which is active in pancreatic acinar cells (Swift et 

10 al., Ce// 35:639-646 (1984); Ornitz et al., Coid Spring Harbor Symp. Quant, Biol. 
50:399-409 (1986); MacDonald, Hepatology 7:425-515 (1987)); insulin gene 
control region which is active in pancreatic beta cells (Hanahan et al.. Nature 
3/5:115-122 (1985)), immunoglobulin gene control region which is active in 
lymphoid cells (Grosschedl et al., Cei/ 38:647-668 (1984); Adams et a!.. Nature 

15 3/5:533-538 (1985); Alexander et al.. Mo/. Ce/i Bio/. 7:1436-1444 (1987)), 

mouse mammary tumor virus control region which is active in testicular, breast, 
lymphoid and mast cells (Leder et al., Ce// 45:485-495 (1986)), albumin gene 
control region which is active in liver (Pinckert et al.. Genes and Deve/. 7:268- 
276 (1987)), aipha-fetoprotein gene control region which is active in liver 

20 (Krumlauf et al., /Wo/. Ce/A Bio/. 5:1639-1648 (1985); Hammer et al., Science 

235:53-58 1987)), alpha-1 antitrypsin gene control region which is active in liver 
(Kelsey et al., Genes and Deve/. /: 161 -171 (1987)), beta globin gene control 
region which is active in myeloid cells (Mogram et al.. Nature 3/5:338-340 
(1985); Kollias et al., Ce// 45:89-94 (1986)), myelin basic protein gene control 

25 region which is active in oligodendrocyte cells of the brain (Readhead et al., Ce// 
45:703-712 (1987)), myosin liight chain-2 gene control region which is active in 
skeletal muscle (Sani, Nature 3/4:283-286 (1985)), and gonadotrophic releasing 
hormone gene control region which is active in gonadotrophs of the 
hypothalamus (Mason et al.. Science 234:1372-1378 (1986)). 

30 In a specific embodiment, a vector is used that contains a promoter 

operably linked to nucleic acids encoding an MTSP polypeptide, or a domain, 
fragment, derivative or homolog, thereof, one or more origins of replication, and 
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optionaltv, one or more selectable markers {e.g., an antibiotic resistance gene). 
Expression vectors containing the coding sequences, or portions thereof, of an 
MTSP polypeptide, is made, for example, by subcloning the coding portions into 
the EcoRI restriction site of each of the three pGEX vectors (glutathione S- 
5 transferase expression vectors (Smith and Johnson, Gene 7:31-40 (1988)). This 
allows for the expression of products in the correct reading frame. Exemplary 
vectors and systems for expression of the protease domains of the MTSP20 
polypeptides include the well-known P/chia vectors (available, for example, from 
Invitrogen, San Diego, CA), particularly those designed for secretion of the 

10 encoded proteins. The protein can also be expressed cytoplasmically, such as in 
the inclusion bodies. One exemplary vector is described in the EXAMPLES. 

Plasmids for transformation of B, coli cells, include, for example, the pET 
expression vectors (see, U.S patent 4,952,496; available from NOVAGEN, 
Madison, Wl; see, also literature published by Novagen describing the system). 

15 Such plasmids include pET 1 la, which contains the T7lac promoter, T7 

terminator, the inducible E. coii lac operator, and the lac repressor gene; pET 
1 2a-c, which contains the T7 promoter, T7 terminator, and the E. cofi ompT 
secretion signal; and pET 15b and pET19b (NOVAGEN, Madison, Wl), which 
contain a His-Tag™ leader sequence for use in purification with a His column and 

20 a thrombin cleavage site that permits cleavage following purification over the 
column; the T7-lac promoter region and the T7 terminator. 

The vectors are introduced into host cells, such as Pichia cells and 
bacterial cells, such as £ coli, and the proteins expressed therein. Exemplary 
Pichia strains, include, for example, GS1 15. Exemplary bacterial hosts contain 

25 chromosomal copies of DNA encoding T7 BNA polymerase operably linked to an 
inducible promoter, such as the lacUV promoter (see, U.S. Patent No. 
4,952,496). Such hosts include, but are not limited to, the lysogenic E. coli 
strain BL21iDE3). 
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Expression and production of proteins 
The MTSP domains, derivatives and analogs can be produced by various 
methods known In the art. For example, once a recombinant cell expressing an 
MTSP polypeptide, or a domain, fragment or derivative thereof, is Identified, the 
5 individual gene product can be isolated and analyzed. This is achieved by 
assays based on the physical and/or functional properties of the protein, 
including, but not limited to, radioactive labeling of the product followed by 
analysis by get electrophoresis, immunoassay, cross-linking to marker-labeled 
product, and assays of proteolytic activity. 

1 0 The MTSP polypeptides can be isolated and purified by standard methods 

known in the art {either from natural sources or recombinant host celts 
expressing the complexes or proteins), including but not restricted to column 
chromatography {e.g., ion exchange, affinity, gel exclusion, reversed-phase high 
pressure and fast protein liquid), differential centrifugation, differential solubility, 

15 or by any other standard technique used for the purification of proteins. 

Functional properties can be evaluated using any suitable assay known in the 
art. 

Alternatively, once an MTSP polypeptide or Its domain or derivative is 
identified, the amino acid sequence of the protein can be deduced from the 

20 nucleotide sequence of the gene which encodes it. As a result, the protein or its 
domain or derivative can be synthesized by standard chemical methods known in 
the art (e.g. see Hunkapitler et at, Nature 3 70:105-111 {1984)). 

Manipulations of MTSP polypeptide sequences can be made at the protein 
level. Also contemplated herein are MTSP polypeptide proteins, domains 

25 thereof, derivatives or analogs or fragments thereof, which are differentially 
modified during or after translation, e.g., by glycosylation, acetylation, 
phosphorylation, amidatlon, derivatization by known protecting/blocking groups, 
proteolytic cleavage, linkage to an antibody molecule or other cellular ligand. 
Any of numerous chemical modifications can be carried out by known 

30 techniques, including but not limited to specific chemical cleavage by cyanogen 
bromide, trypsin, chymotrypsin, papain, V8 protease, NaBH^, acetylation. 
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formylation, oxidation, reduction, metabolic synthesis in the presence of 
tunicamycin and other such agents. 

In addition, domains, analogs and derivatives of an MTSP polypeptide can 
be chemically synthesized. For example, a peptide corresponding to a portion of 
5 an MTSP polypeptide, which includes the desired domain or which mediates the 
desired activity in vitro can be synthesized by use of a peptide synthesizer. 
Furthermore, if desired, nonclassical amino acids or chemical amino acid analogs 
can be introduced as a substitution or addition into the MTSP polypeptide 
sequence. Non-classical amino acids include but are not limited to the D-isomers 

10 of the common amino acids, a-amino isobutyric acid, 4-aminobutyric acid, Abu, 
2-aminobutyric acid, e-Abu, e-Ahx, 6-amino hexanoic acid, Aib, 2-amino 
isobutyric acid, 3-amlno propionoic acid, ornithine, norleucine, norvaline, 
hydroxyproiine, sarcosine, citrulline, cysteic acid, t-butylglycine, t-butylalanine, 
phenylglycine, cyclohexylalanine, fS-alanine, fluoro-amino acids, designer amino 

1 5 acids such as B-methyl amino acids, Ca-methyl amino acids, Na-methyl amino 
acids, and amino acid analogs in general. Furthermore, the amino acid can be D 
(dextrorotary) or L (levorotary). 

In cases where natural products are suspected of being mutant or are 
isolated from new species, the amino acid sequence of the MTSP polypeptide 

20 isolated from the natural source, as well as those expressed in vitro, or from 
synthesized expression vectors in vivo or in vitro, can be determined from 
analysis of the DNA sequence, or alternatively, by direct sequencing of the 
isolated protein. Such analysis can be performed by manual sequencing or 
through use of an automated amino acid sequenator. 

25 Modifications 

A variety of modifications of the MTSP polypeptides and domains are 
contemplated herein. An MTSP-encoding nucleic acid molecule can be modified 
by any of numerous strategies known in the art (Sambrook et at. (1990), 
Molecular Cionirjg^ A Laboratory Manuai, 2d ed.. Cold Spring Harbor Laboratory, 

30 Cold Spring Harbor, New York). The sequences can be cleaved at appropriate 

sites with restriction endonuclease(s}, followed by further enzymatic modification 
if desired, isolated, and ligated in vitro. In the production of the gene encoding a 
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domain, derivative or analog of MTSP, care should be taken to ensure that the 
modified gene retains the original translationat reading frame, uninterrupted by 
translational stop signals, in the gene region where the desired activity is 
encoded. 

5 Additionally, the MTSP-encoding nucleic acid molecules can be mutated 

in vitro or in vivo, to create and/or destroy translation, initiation, and/or 
termination sequences, or to create variations in coding regions and/or form new 
restriction endonuclease sites or destroy pre-existing ones, to facilitate further in 
vitro modification. Also, as described herein muteins with primary sequence 

10 alterations, such as replacements of Cys residues and elimination or addition of 
glycosylation sites are contemplated; the MTSP20 that includes the sequence of 
amino acids set forth in SEQ ID No. 6 has potential glycosylation sites at 
...N77QS... and ...NeailS (SEQ ID No. 16). Mutations can be effected by any 
technique for mutagenesis known in the art, including, but not limited to, 

15 chemical mutagenesis and in vitro site-directed mutagenesis (Hutchinson et al., 
J. Biol. Chem. 253:6551-6558 (1978)), use of TAB* linkers (Pharmacia). In one 
embodiment, for example, an MTSP polypeptide or domain thereof is modified to 
include a fluorescent label. In other specific embodiments, the MTSP 
polypeptide is modified such that heterobifunctional reagents can be used to 

20 crosslink the members of a complex. 

In addition, domains, analogs and derivatives of an MTSP can be 
chemically synthesized. For example, a peptide corresponding to a portion of an 
MTSP, which Includes the desired domain or which mediates the desired activity 
in vitro can be synthesized by use of a peptide synthesizer. Furthermore, if 

25 desired, nonclassical amino acids or chemical amino acid analogs can be 

introduced as a substitution or addition into the MTSP sequence. Non-classical 
amino acids include but are not limited to the D-isomers of the common amino 
acids, a-amino isobutyric acid, 4-aminobutyric acid, Abu, 2-aminobutyric acid, 
€-Abu, e-Ahx, 6-amino hexanoic acid, Aib, 2-amino isobutyric acid, 3-amino 

30 propionoic acid, ornithine, norleucine, norvaline, hydroxyproline, sarcosine, 
citrulline, cysteic acid, t-butylglycine, t-butylalanine, phenylglycine, 
cyclohexylalanine, IS-alanine, fluoro-amino acids, designer amino acids such as B- 
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methyt amino acids, Ca-methyl amino acids, Na-methyl amino acids, and amino 
acid analogs in general. Furthermore, the amino acid can be D (dextrorotary) or 
L (levorotary). 
F. Screening Methods 
5 The single chain protease domains, as shown herein, can be used in a 

variety of methods to identify compounds that modulate an activity thereof. For 
SPs that exhibit higher activity or expression In tumor cells or angiogenic 
endothelial cells, compounds that inhibit the proteolytic activity are of particular 
interest. For any SPs that are active at lower levels in tumor cells or quiescent 

10 endothelial cells, compounds or agents that enhance an activity are potentially of 
interest. In all instances the identified compounds include agents that are 
candidate cancer treatments. 

Several types of assays are exemplified and described herein. It is 
understood that a protease domain or domains can be used in other assays. The 

1 5 single chain protease domains exhibit catalytic activity. As such they are 

suitable for in vitro screening assays. They can also be used in binding assays. 

The MTSP20 full length zymogens, activated enzymes, single and multi- 
chain protease domains are contemplated for use in any screening assay known 
to those of skill in the art, including those provided herein. Hence the following 

20 description, if directed to proteolytic assays is intended to apply to use of a 

single chain protease domain or a catalyticatly active portion thereof of any SP, 

including an MTSP20. Other assays, such as binding assays are provided 

herein, particularly for use with an MTSP20, including any variants, such as 

splice variants thereof. 

25 1 . Catalytic Assays for identification of agents that modulate the 

protease activity of an MTSP protein 

Methods for identifying a modulator of the catalytic activity of an SP, 

particularly a single chain protease domain or catalytically active portion thereof, 

are provided herein. The methods can be practiced by: contacting the MTSP20, 

30 a full-length zymogen or activated form, and particularly a single-chain domain 

thereof, with a substrate of the MTSP20 in the presence of a test substance, 

and detecting the proteolysis of the substrate, whereby an activity of the 
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MTSP20 is assessed, and comparing the activity to a control. For example, a 
control can be the activity of the MTSP20 assessed by contacting an MTSP20, 
including a full-length zymogen or activated form, and particularly a single-chain 
domain thereof, with a substrate of the MTSP20, and detecting the proteolysis 
5 of the substrate, whereby an activity of the MTSP20 Is assessed. The results in 
the presence and absence of the test compounds are compared. A difference in 
an activity indicates that the test substance modulates an activity of the 
MTSP20. Modutatorsjncluding activators and inhibitors, of MTSP20 activation 
cleavage are also contemplated; such assays are discussed below. 

10 ' In one embodiment a plurality of the test substances are screened 
simultaneously in the above screening method. In another embodiment, the 
MTSP20 is isolated from a target cell as a means for then identifying agents that 
are potentially specific for the target cell. 

In another embodiment, a test substance is a therapeutic compound, 

15 whereby a difference of the MTSP20 activity measured in the presence and in 
the absence of the test substance Indicates that the target cell responds to the 
therapeutic compound. 

One method includes the steps of (a) contacting the MTSP20 polypeptide 
or protease domain thereof with one or a plurality of test compounds under 

20 conditions conducive to interaction between the ligand and the compounds; and 
(b) identifying one or more compounds in the plurality that specifically binds to 
the ligand. 

Another method provided herein includes the steps of a) contacting an 
MTSP20 polypeptide or protease domain thereof with a substrate of the 

25 MTSP20 polypeptide, and detecting the proteolysis of the substrate, whereby 
the activity of an MTSP20 polypeptide is assessed; b} contacting the MTSP20 
polypeptide with a substrate of the MTSP20 polypeptide in the presence of a 
test substance, and detecting the proteolysis of the substrate, whereby an 
activity of the MTSP20 polypeptide is assessed; and c) comparing the activity of 

30 the MT5P20 polypeptide assessed in steps a) and b), whereby the activity 

measured in step a) differs from the activity measured in step b) indicates that 
the test substance modulates an activity of the MTSP20 polypeptide. 
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In another embodiment, a plurality of the test substances are screened 
simultaneously. In comparing an activity of an MTSP20 polypeptide in the 
presence and absence of a test substance to assess whether the test substance 
is a modulator of the MTSP20 polypeptide, it is unnecessary to assay the 
5 activity in parallel, although such parallel measurement is typical. It is possible 
to measure an activity of the MTSP20 polypeptide at one time point and 
compare the measured activity to a historical value of an activity of the MTSP20 
polypeptide. 

For instance, one can measure an activity of the MTSP20 polypeptide in 

1 0 the presence of a test substance and compare with historical value of an activity 
of the MTSP20 polypeptide measured previously in the absence of the test 
substance, and vice versa. This can be accomplished, for example, by providing 
the activity of the MTSP20 polypeptide on an insert or pamphlet provided with a 
kit for conducting the assay. 

1 5 Methods for selecting substrates for a particular SP are described in the 

EXAMPLES, and particular proteolytic assays are exemplified. 

Combinations and kits containing the combinations optionally including 
instructions for performing the assays are provided. The combinations include 
an MTSP20 polypeptide and a substrate of the MTSP20 polypeptide to be 

20 assayed; and, optionally reagents for detecting proteolysis of the substrate. 
The substrates, which can be chromogenic or fluorogenic molecules, including 
proteins, subject to proteolysis by a particular MTSP20 polypeptide, can be 
identified empirically by testing the ability of the MTSP20 polypeptide to cleave 
the test substrate. Substrates that are cleaved most effectively (/.e., at the 

25 lowest concentrations and/or fastest rate or under desirable conditions), are 
identified. 

Additionally provided herein is a kit containing the above-described 
combination. The kit optionally includes instructions for identifying a modulator 
of an activity of an MTSP20 polypeptide. Any MTSP20 polypeptide is 
30 contemplated as target for identifying modulators of an activity thereof. 
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2. Binding assays 

Also provided herein are methods for identification and isolation of 
agents, particularly compounds that bind to MTSP20s. The assays are designed 
to identify agents that bind to the zymogen form, a single chain isolated protease 
5 domain (or a protein, other than an MTSP20 polypeptide, that contains a 
protease domain of an MTSP20 polypeptide), and to the activated form, 
including the activated form derived from the full length zymogen or from an 
extended protease domain. The identified compounds are candidates or leads 
for identification of compounds for treatments of tumors and other disorders and 

10 diseases involving aberrant angiogenesis. The MTSP20 polypeptides used in 
the methods include any MTSP20 polypeptide as defined herein, including the 
MTSP20 single chain protease domain or proteolytically active portion thereof. 

A variety of methods are provided herein. These methods can be 
performed in solution or in solid phase reactions in which the MTSP20 

1 5 polypeptide's) or protease domain(s) thereof are linked, either directly or 

indirectly via a linker, to a solid support. Screening assays are described in the 
Examples, and these assays have been used to identify candidate compounds. 
For purposes herein, all binding ass^ays described above are provided for 
MTSP20. 

20 Methods for identifying an agent, such as a compound, that specifically 

binds to an MTSP20 single and/or two chain protease domain, a zymogen and/or 
full-length activated multi-chain (two, three or more chains) MTSP20 or two 
chain protease domain thereof are provided herein. The method can be practiced 
by (a) contacting the MTSP20 with one or a plurality of test agents under 

25 conditions conducive to binding between the MTSP20 and an agent; and (b) 
identifying one or more agents within the plurality that specifically binds to the 
MTSP20. 

For example, in practicing such methods the MTSP20 polypeptide is 
mixed with a potential binding partner or an extract or fraction of a cell or tissue 
30 under conditions that allow the association of potential binding partners with the 
polypeptide. After mixing, peptides, polypeptides, proteins or other molecules 
that have become associated with an MTSP20 are separated from the mixture. 



wo 03/004681 



PCT/US02/21208 



-86- 

The binding partner that bound to the MTSP20 can then be removed and further 
analyzed. To identify and isolate a binding partner, the entire protein, for 
instance the entire polypeptide of SEQ ID No. 6 can be used. Alternatively, a 
fragment of the protein can be used. 
5 A variety of methods can be used to obtain cell or tissue extracts or body 

fluids, such as blood, serum, urine, sweat, synovial fluid, CSF and other such 
fluids. For example, cells and tissues can be disrupted using either physical or 
chemical disruption methods. Examples of physical disruption methods include, 
but are not limited to, sonication and mechanical shearing. Examples of 

10 chemical lysis methods include, but are not limited to, detergent lysis and 

enzyme lysis. A skilled artisan can adapt methods for preparing cellular or tissue 
extracts in order to obtain extracts for use in the present methods. 

Once an extract of a cell or tissue is prepared, the extract is mixed with 
the MTSP20 under conditions in which association of the protein with the 

15 binding partner can occur. A variety of conditions can be used, including 

conditions that resemble conditions found in the cytoplasm of a human cell, in a 
tissue, such as a tumor, or In a body fluid, such as blood. Features, such as 
osmolarlty, pH, temperature, and the concentration of extract used, can be 
varied to optimize the association of the protein with the binding partner. 

20 Similarly, methods for isolation of molecules of interest from body fluids are 
known. 

After mixing under appropriate conditions, the bound complex is 
separated from the mixture. A variety of techniques can be used to separate the 
mixture. For example, antibodies specific to an MTSP20 can be used to 

25 immunoprecipltate the binding partner complex. Alternatively, standard chemical 
separation techniques such as chromatography and density/sediment 
centrifugation can be used. After removing the non-associated cellular 
constituents in the extract, the binding partner can be dissociated from the 
complex using conventional methods. For example, dissociation can be 

30 accomplished by altering the salt concentration or pH of the mixture. 

To aid in separating associated binding partner pairs from the mixed 
extract, the MTSP20 can be immobilized on a solid support. For example, the 
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protein can be attached to a nitrocellulose matrix or acrylic beads. Attachment 
of the protein or a fragment thereof to a solid support aids in separating 
peptide/binding partner pairs from other constituents found in the extract. 
Alternatively MTSP20 conjugates, such as those provided herein, can be used to 
5 separte binding partners from an extract. The Identified binding partners can be 
either a single protein or a complex made up of two or more proteins. 

Ahernatively, the nucleic acid molecules encoding the single chain 
proteases can be used in a yeast two-hybrid system. The yeast two-hybrid 
system has been used to identify other protein partner pairs and can readily be 

10 adapted to employ the nucleic acid molecules herein described. 

Another in vitro binding assay^ particularly for an MTSP20, uses a 
mixture of a polypeptide that contains at least the catalytic domain of one of 
these proteins and one or more candidate binding targets or substrates. After 
incubating the mixture under appropriate conditions, the ability of the MTSP20 

15 or a polypeptide fragment thereof containing the catalytic domain to bind to or 
interact with the candidate substrate is assessed. For cell-free binding assays, 
one of the components includes or is coupled to a detectable label. The label 
can provide for direct detection, such as radioactivity, luminescence, optical or 
electron density, etc., or indirect detection such as an epitope tag, an enzyme, 

20 etc, A variety of methods can be employed to detect the label depending on the 
nature of the label and other assay components. For example, the label can be 
detected bound to the solid substrate or a portion of the bound complex 
containing the label can be separated from the solid substrate, and the label 
thereafter detected. 

25 3. Detection of signal transduction 

MTSP20, which is a transmembrane protein, can be involved directly or 
indirectly in signal transduction directly as a cell surface receptor or indirectly by 
activating proteins, such as pro-growth factors that can initiate signal 
transduction. 

30 In addition, secretion of MTSP20, such as an extracellular domain or 

region of MTSP20, can be involved in signal transduction either directly by 
binding to or interacting with a cell surface receptor or indirectly by activating 
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proteins, such as pro-growth factors that can initiate signal transduction. 
Assays for assessing signal transduction are well known to those of skill in the 
art, and can be adapted for use with the MTSP20 polvpeptide. 

Assays for identifying agents that affect or alter signal transduction 
5 mediated directly or indirectly, such as via activation- of a pro-growth factor, by 
an MTSP20, particularly the full length or a sufficient portion to anchor the 
extracellular donnain or a functional portion thereof of an MTSP20 on the surface 
of a cell are provided. Such assays, include, for example, transcription based 
assays in which modulation of a transduced signal is assessed by detecting an 
10 effect on an expression from a reporter gene {see, e.g.. U.S. Patent No. 
5,436,128). 

4. Methods for Identifying Agents that Modulate the Expression a 
Nucleic Acid Encoding an MTSP20 

Another embodiment provides methods for identifying agents that 

15 modulate the expression of a nucleic acid encoding an MTSP20. Such assays 
use any available means of monitoring for changes in the expression level of the 
nucleic acids encoding an MTSP20. 

In one assay format, cell lines that contain reporter gene fusions between 
the open reading frame of MTSP20 or a domain thereof, particularly a protease 

20 domain and any assayable fusion partner can be prepared. Numerous assayable 
fusion partners are known and readily available including the firefly luciferase 
gene and the gene encoding chloramphenicol acetyltransferase (Alam et a/., 
Anai. Biochem. 188: 245-54 (1990)). Cell lines containing the reporter gene 
fusions are then exposed to the agent to be tested under appropriate conditions 

25 and time. Differential expression of the reporter gene between samples exposed 
to the agent and control samples identifies agents which modulate the 
expression of a nucleic acid encoding an MTSP20. 

Additional assay formats can be used to monitor the ability of the agent 
to modulate the expression of a nucleic acid encoding an MTSP20. For 

30 instance, mRNA expression can be monitored directly by hybridization to the 

nucleic acids. Cell lines are exposed to the agent to be tested under appropriate 
conditions and time and total RNA or mRNA is isolated by standard procedures 
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(see, e.g., Sambrook et al. (1989) MOLECULAR CLONING: A LABORATORY 
MANUAL, 2nd Ed. Cold Spring Harbor Laboratory Press). Probes to detect 
differences in RNA expression levels between cells exposed to the agent and 
control cells can be prepared from the nucleic acids. It is typical, but not 
5 necessary, to design probes which hybridize only with target nucleic acids under 
conditions of high stringency. Only highly complennentary nucleic acid hybrids 
form under conditions' of high stringency. Accordingly, the stringency of the 
assay conditions determines the amount of complementarity which should exist 
between two nucleic acid strands in order to form a hybrid. Stringency should 

10 be chosen to maximize the difference in stability between the probe:target hybrid 
and potential probe:non-target hybrids. 

Probes can be designed from the nucleic acids through methods known in 
the art. For instance, the G -t- C content of the probe and the probe length can 
affect probe binding to its target sequence. Methods to optimize probe 

15 specificity are commonly available (see, e.g., Sambrook et al. (1989) 

MOLECULAR CLONING: A LABORATORY MANUAL, 2nd Ed. Cold Spring 
Harbor Laboratory Press); and Ausubel et aL (1995) CURRENT PROTOCOLS IN 
MOLECULAR BIOLOGY, Greene Publishing Co., NY). 

Hybridization conditions are modified using known methods (see, e.g., 

20 Sambrook etal. (1989) MOLECULAR CLONING: A LABORATORY MANUAL. 
2nd Ed. Cold Spring Harbor Laboratory Press); and Ausubel et at, (1995) 
CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, Greene Publishing Co., NY), 
as required for each probe. Hybridization of total cellular RNA or RNA enriched 
for polyA RNA can be accomplished in any available format. For instance, total 

25 cellular RNA or RNA enriched for polyA RNA can be affixed to a solid support, 
and the solid support exposed to at least one probe comprising at least one, or 
part of one of the nucleic acid molecules under conditions in which the probe 
specifically hybridizes. Alternatively, nucleic acid fragments comprising at least 
one, or part of one of the sequences can be affixed to a solid support, such as a 

30 porous glass wafer. The glass wafer can then be exposed to total cellular RNA 
or polyA RNA from a sample under conditions in which the affixed sequences 
specifically hybridize. Such glass wafers and hybridization methods are widely 



wo 03/004681 



PCT/US02/21208 



-90- 

available, for example, those disclosed by Beattie (WO 95/1 1755). By 
examining for the ability of a given probe to specifically hybridize to an RNA 
sample from an untreated cell papulation and from a cell population exposed to 
the agent, agents which up or down regulate the expression of a nucleic acid 
5 encoding the MTSP20 polypeptide, are identified. 

In one format, the relative amounts of a protein between a cell population 
that has been exposed to the agent to be tested compared to an un-exposed 
control cell population can be assayed le.g., a prostate cancer cell line, a lung 
cancer cell line, a colon cancer cell line or a breast cancer cell line). In this 

10 format, probes, such as specific antibodies, are used to monitor the differential 
expression or level of activity of the protein in the different cell populations or 
body fluids. Cell lines or populations, tissues or body fluids are exposed to the 
agent to be tested under appropriate conditions and time. Cellular or tissue 
lysates or body fluids can be prepared from the exposed cell line or tissue or 

1 5 population and a control, unexposed cell line or population, or tissue or 

unexposed body fluid. The lysates or body fluids are then analyzed with the 
probe. 

For example, N- and C- terminal fragments of the MTSP20 can be 
expressed in bacteria and used to search for proteins which bind to these 

20 fragments. Fusion proteins, such as His-tag or GST fusion to the N- or C- 

terminal regions of the MTSP20 can be prepared for use as a substrate. These 
fusion proteins can be coupled to, for example, Glutathione-Sepharose beads and 
then probed with tissue or cell lysates or body fluids. Prior to lysis, the tissues, 
cells or body fluids can be treated with a candidate agent which can modulate 

25 an MTSP20 or proteins that interact with domains thereon. Lysate proteins 
binding to the fusion proteins can be resolved by SDS-PAGE, isolated and 
identified by protein sequencing or mass spectroscopy, as is known in the art. 

Antibody probes are prepared by immunizing suitable mammalian hosts in 
appropriate immunization protocols using the peptides, polypeptides or proteins 

30 if they are of sufficient length [e.g., 4, 5, 6, 7, 8, 9, 10, 11 , 1 2, 1 3. 14, 1 5, 20, 
25, 30, 35, 40 or more consecutive amino acids the MTSP20 polypeptide) or If 
required to enhance immunogenicity, conjugated to suitable carriers. Methods 
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for preparing immunogenic conjugates with carriers, such as bovine serum 
albumin (BSA), keyhole limpet hemocyanin (KLH), or other carrier proteins are 
well known in the art. In some circumstances, direct conjugation using, for 
example, carbodlimide reagents can be effective; in other instances linking 
5 reagents such as those supplied by Pierce Chemical Co., Rockford, IL, can be 
desirable to provide accessibility to the hapten. Hapten peptides can be 
extended at either the amino or carboxy terminus with a Cys residue or 
interspersed with cysteine residues, for example, to facilitate linking to a carrier. 
Administration of the immunogens is conducted generally by injection over a 

10 suitable time period and with use of suitable adjuvants, as is generally 

understood in the art. During the immunization schedule, titers of antibodies are 
taken to determine adequacy of antibody formation. 

Anti-peptide antibodies can be generated using synthetic peptides 
corresponding to, for example, the carboxy terminal amino acids of the MTSP20. 

15 Synthetic peptides can be as small as 1-3 amino acids in length, generally at 
least 4 or more amino acid residues long. The peptides can be coupled to KLH 
using standard methods and can be immunized into animals, such as rabbits or 
ungulates. Polyclonal antibodies can then be purified, for example using Actigel 
beads containing the covalently bound peptide. 

20 While the polyclonal antisera produced in this way can be satisfactory for 

some applications, for pharmaceutical compositions, use of monoclonal 
preparations are generally used. Immortalized cell lines which secrete the 
desired monoclonal antibodies can be prepared using the standard method of 
Kohler etaf,, {Nature 256: 495-7 (1975)) or modifications which effect 

25 immortalization of lymphocytes or spleen cells, as is generally known. The 
immortalized cell lines secreting the desired antibodies are screened by 
immunoassay in which the antigen is the peptide hapten, polypeptide or protein. 
When the appropriate immortalized cell culture secreting the desired antibody is 
identified, the cells can be cultured either in vitro or by production in vivo via 

30 ascites fluid. Of particular interest, are monoclonal antibodies that recognize a 
catalytic domain or an activation cleavage site region(s) of an MTSP20. 
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Additionally, the zymogen or a multi-chain form of the MTSP20 can be 
used to make monoclonal antibodies that recognize conformation epitopes. The 
desired monoclonal antibodies are then recovered from the culture supernatant 
or from the ascites supernatant. Fragments of the monoclonals or the polyclonal 
5 antisera which contain the immunologically significant portion can be used as 
antagonists, as well as the intact antibodies. Use of immunologically reactive 
fragments, such as the Fab, Fab', of FCab'lj fragments are often used, especially 
In a therapeutic context, as these fragments are generally less immunogenic than 
the whole immunoglobulin. The antibodies or fragments can also be produced. 
1 0 Regions of antibodies or fragments that bind specifically to the desired epitopes 
of MTSP20 polypeptides also can be produced In the context of chimeras with 
multiple species origin. 

Agents that are assayed in the above method can be randomly selected 
or rationally selected or designed. The agents can be, as examples, peptides, 
1 5 small molecules, and carbohydrates. A skilled artisan can recognize that there is 
no limit as to the structural nature of the agents. 

The peptide agents can be prepared using standard solid phase (or 
solution phase) peptide synthesis methods, as is known in the art. In addition, 
the peptides can be produced recombinantly from nucleic acid. The nucleic acid 
20 encoding these peptides can be synthesized using commercially available 

oligonucleotide synthesis instrumentation and produced recombinantly using 
standard recombinant production systems. The production using solid phase 
peptide synthesis is necessitated if non-gene-encoded amino acids are to be 
included. 

25 G. Assay formats and selection of test substances that modulate at least 
one activity of an IVITSP20 polypeptide 

Methods for identifying agents that modulate at least one activity of an 
MTSP20 are provided. The methods include phage display and other methods 
for assessing alterations in an activity of an MTSP20. Such methods or assays 
30 can use any means of monitoring or detecting the desired activity. A variety of 
formats and detection protocols are known for performing screening assays. 
Any such formats and protocols can be adapted for identifying modulators of 
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MTSP20 polypeptide activities. The following includes a discussion of exemplary 
protocols. 

1 . High throughput screening assays 

Although the above-described assay can be conducted where a single 
5 MTSP20 polypeptide is screened, and/or a single test substance is screened in 
one assay, the assay typically is conducted in a high throughput screening 
mode, i.e.. a plurality of the SP proteins are screened against and/or a plurality 
of the test substances are screened simultaneously {See generally. High 
Throughput Screening: The Discovery of Bioactive Substances (Devlin, Ed.> 

10 Marcel Dekker, 1997; Sittampalam et al., Curr. Opin, Chem. Biol., r:384-91 
(1997); and Silverman et al., Curr. Opin. Chem. Biol., 2:397-403 (1998)), For 
example, the assay can be conducted in a multi-well {e.g. , 24-, 48-, 96-, 384-, 
1536-weH or higher density), chip or array format. 

High-throughput screening (NTS) is the process of testing a large number 

15 of diverse chemical structures against disease targets to identify "hits" 

(Sittampalam et al., Curr. Opin. Chem. Biol., 7:384-91 (1997)). Current state-of- 
the-art HTS operations are highly automated and computerized to handle sample 
preparation, assay procedures and the subsequent processing of large volumes 
of data. 

20 Detection technologies employed in high-throughput screens depend on 

the type of biochemical pathway being investigated (Sittampalam et al., Curr. 
Opin. Chem, Biol., f:3 84-91 (1997)). These methods include, radiochemical 
methods, such as the scintillation proximity assays (SPA), which can be adapted 
to a variety of enzyme assays (Lerner et al., J. Biomot. Screening, /:1 35-143 

25 (1996); Baker et al.. Anal. Biochem., 23S:20-24 (1 996); Baum et al.. Anal. 

Biochem., 237:129-134 (1996); and Sullivan et al., J. Biomol. Screening 2:19- 
23 (1997)) and protein-protein interaction assays (Braunwalder et al., J. Biomol. 
Screening 7:23-26 (1996); Sonatore et al., Anal. Biochem. 240:289-297 (1996); 
and Chen et al., J. Biol. Chem. 27 ;:25308-2531 5 (1996)), and non-isotopic 

30 detection methods, including but not limited to, colorimetric and luminescence 
detection methods, resonance energy transfer (RET) methods, time-resolved 
fluorescence (HTRF) methods, cell-based fluorescence assays, such as 
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fluorescence resonance energy transfer (FRET) procedures (see, e.^., Gonzalez et 
al., Bhphys. J., ^9:1272-1280 (1995)), fluorescence polarization or anisotropy 
methods (see, e.g., Jameson et al.. Methods Bnzymol. 246:283-300 (1995); 
Jolley, J, BiomoL Screening /:33-38 (1996); Lynch et al.. Anal, Biochem. 
5 247:77-82 (1997)), fluorescence correlation spectroscopy (FCS) and other such 
methods. 

2. Test Substances 
Test compounds, including small molecules, antibodies, proteins, nucleic 
acids, peptides, natural products, extracts containing natural products and 
10 libraries and collections thereof, can be screened in the above-described assays 
and assays described below to Identify compounds that modulate an activity of 
an MTSP20 polypeptide. Rational drug design methodologies that rely on 
computational chemistry can be used to screen and identify candidate 
compounds. 

1 5 The compounds identified by the screening methods include inhibitors, 

including antagonists, and can be agonists. Compounds for screening include 
any compounds and collections of compounds available, known or that can be 
prepared. 

a. Selection of Compounds 

20 Compounds can be selected for their potency and selectivity of inhibition 

of serine proteases, especially an MTSP20 polypeptide. As described herein, 
and as generally known, a target serine protease and its substrate are combined 
under assay conditions permitting reaction of the protease with its substrate. 
The assay is performed in the absence of test compound, and in the presence of 

25 increasing concentrations of the test compound. The concentration of test 
compound at which 50% of the serine protease activity is inhibited by the test 
compound is the ICgo value (Inhibitory Concentration) or EC50 (Effective 
Concentration) value for that compound. Within a series or group of test 
compounds, those having lower IC50 or EC50 values are considered more potent 

30 inhibitors of the serine protease than those compounds having higher IC50 or 
EC50 values. The IC50 measurement is often used for more simplistic assays, 
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whereas the EC^o is often used for more complicated assays, such as those 
employing cells. 

Typically candidate compounds have an ICso value of 100 nM or less as 
measured In an in vitro assay for inhibition of MTSP20 polypeptide activity. The 
5 test compounds also are evaluated for selectivity toward a serine protease. As 
described herein, and as generally knov\^n, a test compound is assayed for its 
potency toward a panel of serine proteases and other enzymes and an IC50 value 
or EC50 value is determined for each test compound in each assay system. A 
compound that demonstrates a low IC50 value or EC^ value for the target 

10 enzyme, e.g., MTSP20 polypeptide, and a higher IC50 value or EC50 value for 
other enzymes within the test panel [e.g., urokinase tissue plasminogen 
activator, thrombin, Factor Xa), is considered to be selective toward the target 
enzyme. Generally, a compound is deemed selective if its ICsg value or EC^q 
value in the target enzyme assay is at least one order of magnitude less than the 

15 next smallest IC50 value or EC^q value measured in the selectivity panel of 
enzymes. 

Compounds are also evaluated for their activity in vivo. The type of 
assay chosen for evaluation of test compounds depends on the pathological 
condition to be treated or prevented by use of the compound, as well as the 

20 route of administration to be evaluated for the test compound. 

For instance, to evaluate an activity of a compound to reduce tumor 
growth through inhibition of MTSP20 polypeptide, the procedures described by 
Jankun et al., Cane. Res. 57:559-563 (1997) to evaluate PAI-1 can be 
employed. Briefly, the ATCC cell lines DU145 and LnCaP are injected into SCID 

25 mice. After tumors are established, the mice are given test compound according 
to a dosing regime determined from the compound's in vitro characteristics. The 
Jankun et ai. compound was administered in water. Tumor volume 
measurements are taken twice a week for about five weeks. A compound is 
deemed active if an animal to which the compound was administered exhibited 

30 decreased tumor volume, as compared to animals receiving appropriate control 
compounds. 
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Another in vivo exiDerimental model designed to evaluate the effect of p- 
aminobenzamidine, a swine protease inhibitor, on reducing tumor volume is 
described by Bilistrom exa\.. Int. J. Cancer 67:542-547 (1995). 

To evaluate the ability of a compound to reduce the occurrence of, or 
5 inhibit, metastasis, the procedures described, for example, by Kobayashi et al. 
Int. J. Cane. 57:727-733d (1994) can be employed. Briefly, a murine xenograft 
selected for high lung colonization potential is injected into C57B1/6 mice i.v. 
(experimental metastasis) or s.c. into the abdominal wall (spontaneous 
metastasis). Various concentrations of the compound to be tested can be 

10 admixed with the tumor cells in Matrigel prior to injection. Daily i.p. injections of 
the test compound are made either on days 1-6 or days 7-13 after tumor 
Inoculation. The animals are sacrificed about three or four weeks after tumor 
inoculation, and the lung tumor colonies are counted. Evaluation of the resulting 
data permits a determination as to efficacy of the test compound, optimal dosing 

1 5 and route of administration. 

An activity of the tested compounds toward decreasing tumor volume 
and metastasis can be evaluated, for example, in a model described in Rabbani 
et at. (1995) Int, J, Cancer 53:840-845 in which Mat LyLu tumor cells were 
injected into the flank of Copenhagen rats. The animals were implanted with 

20 osmotic minipumps to continuously administer various doses of test compound 
for up to three weeks. The tumor mass and volume of experimental and control 
animals were evaluated during the experiment, as were metastatic growths. 
Evaluation of the resulting data permits a determination as to efficacy of the test 
compound, optimal dosing, and route of administration. Some of these authors 

25 described a related protocol in Xing et al.. Cane. Res. 57:3585-3593 (1997). 

To evaluate the anti-angiogenesis activity of a compound, a rabbit cornea 
neovascularization model can be employed (see, e.g., Avery et al. (1990) Arch. 
Ophthalmol., 705: 1474-1 476). Avery et al. describes anesthetizing New 
Zealand albino rabbits and then making a central corneal incision and forming a 

30 radial corneal pocket. A slow release prostaglandin pellet was placed In the 

pocket to induce neovascularization. Test compound was administered i.p. for 
five days, at which time the animals were sacrificed. The effect of the test 
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compound is evaluated by review of periodic photographs taken of the limbus, 
which can be used to calculate the area of neovascutar response and, therefore, 
limbal neovascularization. A decreased area of neovascularization as compared 
with appropriate controls indicates the test compound was effective at 
5 decreasing or inhibiting neovascularization. 

An angiogenesis model used to evaluate the effect of a test compound in 
preventing angiogenesis is described by Min et a/. Cane. fies. 55:2428-2433 
(1996). C57BL6 mice receive subcutaneous injections of a Matrigel mixture 
containing bFGF, as the angiogenesis-inducing agent, with and without the test 

10 compound. After five days, the animals are sacrificed and the Matrigel plugs, in 
which neovascularization can be visualized, are photographed. An experimental 
animal receiving Matrigel and an effective dose of test compound exhibits less 
vascularization than a control animal or an experimental animal receiving a less- 
or non-effective does of compound. 

15 An tn vivo system designed to test compounds for their ability to limit the 

spread of primary tumors is described by Crowley et al., Proc. Nati Acad. Sci. 
50:5021-5025 (1993). Nude mice are injected with tumor cells (PCS) 
engineered to express CAT (chloramphenicol acetyltransf erase). Compounds to 
be tested for their ability to decrease tumor size and/or metastases are 

20 administered to the animals, and subsequent measurements of tumor size and/or 
metastatic growths are made. In addition, the level of CAT detected in various 
organs provides an indication of the ability of the test compound to inhibit 
metastasis; detection of less CAT in tissues of a treated animal versus a control 
animal indicates less CAT-expressing cells migrated to that tissue. 

25 /n vivo experimental models designed to evaluate the inhibitory potential 

of test serine protease inhibitors, using a tumor cell line F3II known to be highly 
invasive (see, e.g., Alonso era/. (1996) Breast Cane. Res. Treat 40:209-223) 
are provided. Alonso describes in vivo studies for toxicity determination, tumor 
growth, invasiveness, spontaneous metastasis, experimental lung metastasis, 

30 and an angiogenesis assay. 

The CAM model (chick embryo chorioallantoic membrane model; 
Ossowski (1988) m/. Ce// Bio/. 707:2437-2445), provides another method for 
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evaluating the inhibitory activity of a test compound, tn the CAM model, tumor 
cells invade through the chorioallantoic membrane containing CAM (with tumor 
cells in the presence of several serine protease inhibitors results in less or no 
invasion of the tumor cells through the membrane). Thus, the CAM assay is 
5 performed with CAM and tumor cells in the presence and absence of various 
concentrations of test compound. The invasiveness of tumor cells is measured 
under such conditions to provide an indication of the compound's inhibitory 
activity. A compound having inhibitory activity correlates with less tumor 
invasion. 

10 The CAM model is also used in a standard assay of angiogenesis (/.e., 

effect on formation of new blood vessels (Brooks et al. (1999) Methods in 
Molecular Biology /2S: 25 7-269). According to this model, a filter disc 
containing an angiogenesis inducer, such as basic fibroblast growth factor 
(bFGF) is placed onto the CAM. Diffusion of the cytokine into the CAM induces 

1 5 local angiogenesis, which can be measured in several ways such as by counting 
the number of blood vessel branch points within the CAM directly below the 
filter disc. The ability of identified compounds to inhibit cytokine-induced 
angiogenesis can be tested using this model. A test compound can either be 
added to the filter disc that contains the angiogenesis inducer, be placed directly 

20 on the membrane or be administered systemically. The extent of new blood 
vessel formation in the presence and/or absence of test compound can be 
compared using this model. The formation of fewer new blood vessels in the 
presence of a test compound would be indicative of anti-angiogenesis activity. 
Demonstration of anti-angiogenesis activity for inhibitors of an MTSP20 

25 polypeptide indicates a role in angiogenesis for that SP protein. 

b. Known serine protease Inhibitors 
Compounds for screening can be serine protease inhibitors, which can be , 
tested for their ability to inhibit an activity of an MTSP20. 

Exemplary serine protease inhibitors for use in the screening assays include, but 
30 are not limited to: Serine Protease Inhibitor 3 {SPI-3} (Chen, et al. Citokine. 

;/:856-862 (1999)); Aprotinin (lijima ef a/. J. Biochem, (Tokyo) 726:912-916 
(1999)); Kazal-type serine protease inhibitor-like proteins (Niimi, et al. Eur. J. 
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Biochem., 266:282-292 (1999)); Kunitz-type serine protease inhibitor 
(Ravichandran, S., et al.. Acta Crystallogr. D. Biol. Crystailogr. , 55:1814-1821 
(1999)); Tissue factor pathway inhibitor-2/Matrix-associated serine protease 
inhibitor (TFPI-2/MSPI1, (Liu, Y. et ah Arch, Biochem, Biophys. 370A 12-8 
5 (1999)1; Bukunin (Cui, C.Y. et aL J. Invest. Dermatol. 7 73:182-8 (1999)); 
Nafmostat mesilate (Ryo, R. et at. (1999) Vox Sang. 7ff:241-6); TPCK (Huang 
et al. (1999) Oncogene 7fi:343 1-3439); A synthetic cotton-bound serine 
protease inhibitor (Edwards (1999) ef a/. Wound Repair Regen. 7:106-18); FUT- 
175 (Sawada (1999) et al. Stroke 50:644-50); Combination of serine protease 

10 inhibitor FUT-0175 and thromboxane synthetase inhibitor OKY-046 (Kaminogo et 
at. (1998) NeuroL Med. Chir. (Tokyo) 55:704-8; discussion 708-9); the rat 
serine protease inhibitor 2.1 gene (LeCam, A., et a!., Biochem. Biophys. Res. 
Commun,, 253:31 1-4 (1998)); A new intracellular serine protease inhibitor 
expressed in the rat pituitary gland complexes with granzyme B (Hill et al. FEBS 

15 Lett. 440:361-4 (1998)); 3,4-Dichloroisocoumarin (Hammed et al. Proc. Soc. 
Exp. Biol. Med., 273:1 32-7 (1 998)); LEX032 (Bains et al. Eur. J. Pharmacol. 
355:67-72 (1998)); N-tosyl-L-phenylalanine chloromethyl ketone (Dryjanski et al. 
Biochemistry 57:14151-6 (1998)); Mouse gene for the serine protease inhibitor 
neuroserpin (P112) (Bergeref a/. Gene, 2 74:25-33 (1998)); Rat serine protease 

20 inhibitor 2.3 gene (Paul et al. Eur. J. Biochem. 254:538-46 (1998)); Ecotin 
(Yang et al. J. Mol. Biol. 273:945-57 (1998)); A 14 kDa plant-related serine 
protease inhibitor (Roch et af. Dev. Comp. Immunol. 22(1): 1-1 2 (1998)); Matrix- 
associated serine protease inhibitor TFPI-2/33 kDa MSPI (Rao et al. Int. J. 
Cancer 75:749-56 (1998)); ONO-3403 (Hiwasa ef al. Cancer Lett. 725:221-5 

25 (1998)); Bdellastasin (Moser et aL Eur. J. Biochem. 253:212-20 (1998)); Bikunin 
(Xu etaL J. Mol. BloL 275:955-66 (1998)); Nafamostat mesilate (Mellgren et al. 
Thromb. Haemost. 75:342-7 (1998)); The growth hormone dependent serine 
protease inhibitor, Spi 2.1 (Maake et al. Endocrinology 735:5630-6 (1997)); 
Growth factor activator inhibitor type 2, a Kunitz-type serine protease inhibitor 

30 (Kawaguchi etaL J. Biol. Chem., 272:27558-64 (1997)); Heat-stable serine 

protease inhibitor protein from ovaries of the desert locust, Schistocerga gregaria 
(Hamdaoui et aL Biochem. Biophys, Res. Commun. 235:357-60 (1997)); Human 
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placental Hepatocyte growth -factor activator inhibitor, a Kunitz-type serine 
protease inhibitor (Shimomura et al. J. Biol. Chem. 272:6370-6 (1997)); FUT- 
187, oral serine protease inhibitor (Shiozaki et al. Gan To Kaguku Ryoho, 23(14): 
1971-9 (1996)); Extracellular matrix-associated serine protease inhibitors {Mr 
5 33,000, 31,000, and 27,000 (Rao et al. (1996) Arch. Btochem. Btophys., 

335:82-92); An irreversible isocoumarin serine protease inhibitor (Palencia, D.D., 
et al., BioL Reprod., 55:536-42 (1996)); 4-(2-aminoethyl)-ben2enesulfonYl 
fluoride (AEBSF) (Nakabo et al, J. Leukoc. Biol. 50:328-36 (1996)); Neuroserpin 
(Osterwalder, T,, et al., EMBO J. 75:2944-53 (1996)); Hunnan serine protease 

10 Inhibitor alpha- 1 -antitrypsin (Forney et al. J. ParasitoL. 52:496-502 (1996)); Rat 
serine protease inhibitor 2.3 (Simar-Blanchet, A.E., et al., Eur. J. Biochem., 
235:638-48 (1996)); Gebaxate mesilate (Parodi, P., et al., J. Cardiothorac. 
Vase. Anesth. 70:235-7 (1996)); Recombinant serine protease Inhibitor, CPTl II 
(Stankiewicz. M., et al., {Acta Biochim. Pol., 43^3^:525-9 (1996)); A cysteine- 

15 rich serine protease inhibitor (Guamerin II) (Kim, D.R., et al., J. Enzym. Inhib., 
70:81-91 (1996)); Diisopropylfluorophosphate (Lundqvist, W., ex d\., inflamm. 
Res., 44(12h^yQ-l (1995)); Nexin 1 (Yu, D.W., et al., J. CellScL, 108(Pt 
727:3867-74 (1995)); LEX032 (Scalia, R., et al.. Shock, 4(4J:25^-6 (1995)); 
Protease nexin I (Houenou, L.J., et al., Proc. Natl. Acad. Set. U.S.A., 92{3J:B9B- 

20 9 (1995)); Chymase-directed serine protease inhibitor (Woodard S.L., et al., J. 
Immunol., I S3(f ^):S0^6-2B (1994)); N-alpha-tosyl-L-lysyl-chloromethyl ketone 
(TLCK) (Bourinbaiar, A.S., et al., Cell Immunol. , 7 55f 7):230-6 (1994)); Smpi56 
(Ghendler, Y., et al., Exp. ParasitoL, 78f2}:^2^'3^ (1994)); Schistosoma 
haematobium serine protease inhibitor (Blanton, R.E., et at., Mol. Biochem, 

25 ParasitoL, 63fV:^-^ 1 (1994)); Spi-1 (Warren, W.C., et al„ MoL Cell EndocrinoL. 
98(11:27 '32 (1993)); TAME (Jessop, J.J., et al.. Inflammation, 7 7^5; :6 13-31 
(1993)); Antithrombin III (Kalaria. R.N., et al,. Am. J. PathoL. 743(3):Q86-93 
(1993)); FOY-305 (Ohkoshi, M., et a\.. Anticancer Res., 73{4):963'6 (1993)); 
Camostat mesilate (Senda, S., et al.. Intern. Med., 32^4^:350-4 (1993)); Pigment 

30 epithelium-derived factor (Steele, F.R., et al., Proc. NatL Acad. ScL U.S.A., 
S0W:1 526-30 (1993)); Antistasin (Holstein, T.W., et al., FEBS Lett., 
309(3}:2SB-92 (1992)); The vaccinia virus K2L gene encodes a serine protease 
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inhibitor (Zhou, et al.. Virology, 789f2):e7B-Q6 {1992)); Bowman-Birk serine- 
protease inhibitor (Werner, M.H., et a\.,J. MoL Bio/,, 225(3) -.87 3-B9 (1992); 
FUT-175 (Yanamoto, H., et al., Neurosurgery, 30{3J:3BB-63 (1992)); FUT-175; 
(Yanamoto, H., et al.. Neurosurgery, 30(3):3S^'6, discussion 356-7 (1992)); 
5 PAI-I (Yreadwell, B.V., et al., J. Orthop. Res., 9^3^:309-16 (1991)); 3,4- 

Dichloroisocoumarin (Rusbridge, N.M., et al., FEBS Lett., 268(1 h^ZZ-Q (1990)); 
Alpha 1-antichymotrypsin (Lindmark, B.E., et a\.,Am. Rev, Respir. Des., 141(4 
Pt IhBBA-B (1990)); P-toluenesulfonyl-L-arginine methyl ester (TAME) (Scuderi, 
?„J. /mmunol., 143(1h'\^B-13 (1989)); Alpha 1-antichymotrypsin (Abraham, 

10 C.R., et a!.. Ceil, 52^:487-501 (1988)); Contrapsin (Modha, J., et al.. 

Parasitology, 96 (Pt r;:99-109 (1988)); Alpha 2-antiplasmin (Holmes, W.E., et 
al., J. BioL Chem., 252^;: 1659-64 (1987)); 3,4-dtchloroisocoumarin (Harper, 
J.W., et al.. Biochemistry, 24^:1831-41 (1985)); Diisoprophylfluorophosphate 
Diisopropylfluorophosphate (Tsutsui, K., et al., Biochem. Biophys. Res. 

15 Commun.. 123(1 }:21^•^ (1984)); Gabexate mesilate (Hesse, B., et al., 
Pharmacol. Res. Commun., 1€(7i:637-45 (1984)); Phenyl methyl sulfonyl 
fluoride (Dufer, J., et aL, Scand. J. Haematol., 32(V:2B-32 (1984)); Protease 
Inhibitor CI-2 (McPhalen, C.A., et al., J. Mol. Bioi, 168(2J'A4S-7 (1983)); 
Phenylmethylsulfonyl fluoride (Sekar V., et al., Biochem. Biophys. Res. 

20 Commun., 89(2):474'B (1979)); PGE1 (Feinstein, M.D., et al., Prostaglandine , 
14(6|:^07B-93 (1977). 

c. Combinatorial libraries and other libraries 
The source of compounds for the screening assays, can be libraries, 
including, but are not limited to, combinatorial libraries. Methods for 

25 synthesizing combinatorial libraries and characteristics of such combinatorial 
libraries are known in the art (See generally. Combinatorial Libraries: Synthesis, 
Screening and Application Potential {Cortese Ed.) Walter de Gruyter, Inc., 1995; 
Tietze and Lieb, Curr. Opin. Chem. Biol.. 2^3^:363-71 (1998); Lam, Anticancer 
Drug Des.. 12(3i:^4B^67 (1997); Blaney and Martin, Curr. Opin. Chem. Biol., 

30 ;^/;:54-9 (1997); and Schultz and Schultz, Biotechnol. Prog., 12(6J:7 29-43 
(1996)). 
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Methods and strategies for generating diverse libraries, primarilv peptide- 
and nucleotide-based oligomer libraries, have been developed using molecular 
biology methods and/or simultaneous chemical synthesis methodologies {see, 
e.g., Dower et al.. Annu. Rep. Med. Chem., 2ff:271-280 (1991); Fodor et al., 
5 Science, 251:767-773 (1991); Jung et al., Angew. Chem. Ind. Ed. Engi.. 
3/:367-383 (1992); Zuckerman et al., Proc. Natl. Acad. Sol USA, 55:4505- 
4509 (1992); Scott et al.. Science, 245:386-390 (1990); Devlin et al.. Science, 
245:404-406 (1990); Cwiria et al., Proc. Natl. Acad. Sci. USA, 57:6378-6382 
(1990); and Gallop et al., J. Medicinai Chemistry, 57:1233-1251 (1994)). The 

1 0 resulting combinatorial libraries potentially contain millions of compounds that 
can be screened to identify compounds that exhibit a selected activity. 

The libraries fall into roughly three categories: fusion-protein-displayed 
peptide libraries in which random peptides or proteins are presented on the 
surface of phage particles or proteins expressed from plasmids; support-bound 

1 5 synthetic chemical libraries in which individual compounds or mixtures of 

compounds are presented on insoluble matrices, such as resin beads {see^ e.g.. 
Lam et al.. Nature, 554:82-84 (1991)) and cotton supports (see, e.g., Eichler et 
al.. Biochemistry 52:1 1035-1 1041 (1993)); and methods in which the 
compounds are used in solution (see, e.g., Houghten et al.. Nature, 554:84-86 

20 (1991); Houghten et al., BioTechniques, 575:412-421 (1992); and Scott et al., 
Curr. Opin. Biotechnoi., 5:40-48 (1994)). There are numerous examples of 
synthetic peptide and oligonucleotide combinatorial libraries and there are many 
methods for producing libraries that contain non-peptidic small organic mole- 
cules. Such libraries can be based on a basis set of monomers that are 

25 combined to form mixtures of diverse organic molecules or that can be combined 
to form a library based upon a selected pharmacophore monomer. 

Either a random or a deterministic combinatorial library can be screened 
by the presently disclosed and/or claimed screening methods. In either of these 
two libraries, each unit of the library is isolated and/or immobilized on a solid 

30 support. In the deterministic library, one knows a priori a particular unit's 

location on each solid support. In a random library, the location of a particular 
unit is not known a priori although each site still contains a single unique unit. 
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Many methods for preparing libraries are known to those of skill in this art (see, 
e.g., Geysen et al., Proc. Natl. Acad. Set. USA, 7:3998-4002 (1984), 
Houghten et al., Proc. Nat/. Acad. Sci. USA. 5/:51 31-51 35 (1 985)). 
Combinatorial libraries generated by any techniques known to those of skill in 
5 the art are contemplated (see, e.g.. Table 1 of Schultz and Schultz, Biotechnol. 
Prog., 72(6):729'A3 (1996)) for screening; Bartel et aL, Science, 2ff7:141 1- 
1418 (1993); Baumbach eta/. BioPharm. (Canh24~3S (1992); Bock eta/. 
Nature, 555:564-566 (1992); Borman, S., Combinatorial chemists focus on 
small molecules molecular recognition, and automation, Chem. Eng. News, 

10 2(/2J:29 (1996); Boublik, et al., Eukaryotic Virus Display: Engineering the Major 
Surface Glycoproteins of the Autographa California Nuclear Polyhedrosis Virus 
(ACNPV) for the Presentation of Foreign Proteins on the Virus Surface, 
Bio/Techno/ogy. 73:1079-1084 (1995); Brenner, et al.. Encoded Combinatorial 
Chemistry, Proc. Nat/. Acad Sci. U.S.A., 55:5381-5383 (1992); Caflisch, et al., 

15 Computational Combinatorial Chemistry for De Novo LIgand Design: Review and 
Assessment, Perspect. Drug Discovery Des., 3:51-84 (1995); Cheng, et al., 
Sequence-Selective Peptide Binding with a Pe pt i d o- A, B-frans- steroidal Receptor 
Selected from an Encoded Combinatorial Library, J. Am. Chem. Soc, 7 75:1813- 
1814 (1996); Chu, et al.. Affinity Capillary Electrophoresis to Identify the 

20 Peptide in A Peptide Library that Binds Most Tightly to Vancomycin, J. Org. 

Chem., 55:648-652 (1993); Clackson, et al.. Making Antibody Fragments Using 
Phage Display Libraries, Nature, 352:624-628 (1991); Combs, et al., Protein 
Structure-Based Combinatorial Chemistry: Discovery of Non-Peptide Binding 
Elements to Src SH3 Domain, J. Am. Chem. Soc, 7 75:287-288 (1996); Cwtria, 

25 et al.. Peptides On Phage: A Vast Library of Peptides for Identifying Ligands, 

Proc. Nat/. Acad. Sci. U.S.A.. 57:6378-6382 (1990); Ecker, et al.. Combinatorial 
Drug Discovery: Which Method will Produce the Greatest Value, 
Bio/Technology. 73:351-360 (1995); Ellington, et al.. In Vitro Selection of RNA 
Molecules That Bind Specific Ligands. Nature, 346:818-822 (1990); Ellman, 

30 J.A., Variants of Benzodiazephines, J. Aw. Chem. Soc, 7 74:10997 (1992); 

Erickson, et al.. The Proteins; Neurath, H., Hill, R.L., Eds.: Academic: New York, 
1976; pp. 255-257; Felici, et al., J. Mo/. Bio/., 222:301-310 (1991); Fodor, et 
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al., Light-Directed, Spatially Addressable Parallel Chemical Synthesis, Science. 
251-JQ1'713 {1991); Francisco, et al.. Transport and Anchoring of Beta- 
Lactamase to the External Surface of E. Co//,, Proc. Natl. Acad. Sci. U.S.A., 
55:2713-2717 (1992); Georgiou, et al.. Practical Applications of Engineering 
5 Gram-Negative Bacterial Cell Surfaces, TtBTECH, 7 7:6-10 (1993); Geysen, et al., 
Use of peptide synthesis to probe viral antigens for epitopes to a resolution of a 
single amino acid, Proc. Naif. Acad. Sci. U.S.A., S7:3998-4002 {1984); Glaser, 
et al.. Antibody Engineering by Codon-Based Mutagenesis in a Filamentous 
Phage Vector System, J. Immunol.. 745:3903-3913 (1992); Gram, et aL, In 

10 vitro selection and affinity maturation of antibodies from a naive combinatorial 
immunoglobulin library, Proc. Nati Acad. Set.. 55:3576-3580 (1992); Han, et 
al., Liquid-Phase Combinatorial Synthesis, Proc. Nati. Acad. Sci. U.S.A.. 
52:6419-6423 (1995); Hoogenboom, et al., Multi-Subunit Proteins on the 
Surface of Filamentous Phage: Methodologies for Displaying Antibody (Fab) 

15 Heavy and Light Chains, Nucleic Acids Res.. 75:4133-4137 {1991); Houghten, 
et al.. General Method for the Rapid Solid-Phase Synthesis of Large Numbers of 
Peptides: Specificity of Antigen-Antibody Interaction at the Level of Individual 
Amino Acids, Proc. Nat/. Acad. Sci. U.S.A.. 52:5131-5135 (1985); Houghten, 
et al.. The Use of Synthetic Peptide Combinatorial Libraries for the Determination 

20 of Peptide Ligands in Radio-Receptor Assays-Opiod-Peptides, Bioorg. /\4ed. 
Chem. Lett.. 5:405-412 (1993); Houghten, et a!.. Generation and Use of 
Synthetic Peptide Combinatorial Libraries for Basic Research and Drug Discovery, 
Nature. 554:84-86 (1991); Huang, et al.. Discovery of New Ligand Binding 
Pathways in Myoglobin by Random Mutagenesis, Nature Struct. Biol., 7:226-229 

25 (1994); Huse, et al., Generation of a Large Combinatorial Library of the 

Immunoglobulin Repertoire In Phage Lambda, Science. 246:1275-1281 (1989); 
Janda, K.D., New Strategies for the Design of Catalytic Antibodies, Biotechnol. 
Prog.. 5:178-181 (1990); Jung, et al., Multiple Peptide Synthesis Methods and 
Their Applications, Angew. Chem. Int. Ed. Eng/.. 57:367-486 (1992); Kang, et 

30 al,. Linkage of Recognition and Replication Functions By Assembling 

Combinatorial Antibody Fab Libraries Along Phage Surfaces, Proc. Natl. Acad. 
Sci. U.S.A.. 55:4363-4366 (1991a); Kang, et al.. Antibody Redesign by Chain 
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Shuf fling from Random Combinatorial Immunoglobulin Libraries, Proc. Nat/. 
Acad. SU U.S.A., 88'A 1 120-1 1 123 (1991b); Kay, et al.. An M13 Phage Library 
Displaying Random 38-Amino-Acid-Peptides as a Source of Novel Sequences 
with Affinity to Selected Targets Genes, Gene. /2a:59-65 (1993); Lam, et al.. A 
5 new type of synthetic peptide library for identifying ligand-binding activity. 
Nature, 354:82-84 (1991) (published errata appear in Nature, 35ff:434 (1992) 
and Nature, 360:76B (1992); Lebl, et al.. One Bead One Structure Combinatorial 
Libraries, Biopolymers (Pept. Set'.), 37:177-198 (1995); Lerner, et al.. Antibodies 
without Immunization, Science, 255:1313-1314 (1992); Li, et a!., Minimization 

10 of a Polypeptide Hormone, Science, 270:1657-1660 (1995); Light, et al., 

Display of Dimeric Bacterial Alkaline Phosphatase on the Major Coat Protein of 
Filamentous Bacteriophage, Bioorg. Med. Chem. Lett, 3:1073-1079 (1992); 
Little, et al.. Bacterial Surface Presentation of Proteins and Peptides: An 
Alternative to Phage Technology, Trends BiotechnoL, 7/:3-5 (1993); Marks, et 

15 al., By-Passing Immunization. Human Antibodies from V-Gene Libraries 
Displayed on Phage, J. Mol. BioL. 222:581-597 (1991); Matthews, et al., 
Substrate Phage: Selection of Protease Substrates by Monovalent Phage Display, 
Science, 260-A 113-1117 (1993); McCafferty, et al.. Phage Enzymes: Expression 
and Affinity Chromatography of Functional Alkaline Phosphatase on the Surface 

20 of Bacteriophage, Protein Eng., 4:955-961 (1991); Menger, et al.. Phosphatase 
Catalysis Developed Via Combinatorial Organic Chemistry, J. Org. Chem., 
60:6666-6667 (1995); Nicolaou, et al., Angew. Chem. tnt. Ed. EngL. 34:2289- 
2291 (1995); Oldenburg, et al.. Peptide Ligands for A Sugar-Binding Protein 
Isolated from a Random Peptide Library, Proc. Natl. Acad. Set. U.S.A., 53:5393- 

25 5397 (1992); Parmley, et at., Antibody-Selectable Filamentous fd Phage Vectors: 
Affinity Purification of Target Genes, Genes, 73:305-318 (1988); Pinilla, et al., 
Synthetic Peptide Combinatorial Libraries (SPCLS)-ldentification of the Antigenic 
Determinant of Beta-Endorphin Recognized by Monoclonal Antibody-3E7, Gene, 
128:7^^7G (1993); Pinilla, et al., Review of the Utility of Soluble Combinatorial 

30 Libraries, Biopoiymers, 37:221-240 (1995); Prstor, et al.. Expression of Viral 
Hemagglutinin On the Surface of E. Coli., Klin. Wochenschr., 55:110-116 
(1989); Pollack, et al.. Selective Chemical Catalysis by an Antibody, Science, 
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254:1570-1572 (1986); Rigler, et al.. Fluorescence Correlations, Single Molecule 
Detection and Large Number Screening: Applications in Biotechnology, J, 
Biotechno!., 47:177-186 (1995); Sarvetnick, et al., Increasing the Chemical 
Potential of the Germ- Line Antibody Repertoire, Proc. Natl. Acad. Sci. U.S.A., 
5 50:4008-401 1 (1993); Sastry et a/.. Cloning of the Immunological Repertoire in 
Escherichia Cofi for Generation of Monoclonal Catalytic Antibodies: Construction 
of a Heavy Chain Variable Region-Specific cDNA Library, Pfoc. Nati Acad. Sci. 
U.S.A.. 56:5728-5732 (1989); Scott, et al.. Searching for Peptide Ligands with 
an Epitope Library, Science. 245:386-390 (1990); Sears, et al.. Engineering 

10 Enzymes for Bloorganic Synthesis: Peptide Bond Formation, Biotechno/. Prog., 
72:423-433 (1996); Simon, et. al., Peptides: A Modular Approach to Drug 
Discovery, Proc. Nat/. Acad. Sci. U.S.A.. 55:9367-9371 (1992); Still, et al.. 
Discovery of Sequence-Selective Peptide Binding by Synthetic Receptors Using 
Encoded Combinatorial Libraries, Acc. Chem. Res.. 25:155-163 (1996); 

15 Thompson, et al.. Synthesis and Applications of Small Molecule Libraries, Chem. 
Rev., 55:555-600 (1996); Tramontano, et al.. Catalytic Antibodies, Science, 
234:1566-1570 (1986); Wrighton, et al., Small Peptides as Potent Mimetics of 
the Protein Hormone Erythropoietin, Science, 275:458-464 (1996); York, et al.. 
Combinatorial mutagenesis of the reactive site region in plasminogen activator 

20 inhibitor I, J. Bio/. Chem.. 255:8595-8600 (1991 ); Zebedee, et al.. Human 
Combinatorial Antibody Libraries to Hepatitis B Surface Antigen, Proc. Natl. 
Acad. Sci. U.S.A., 55:3175-3179 (1992); Zuckerman, et al.. Identification of 
Highest-Affinity Ligands by Affinity Selection from Equimolar Peptide Mixtures 
Generated by Robotic Synthesis, Proc. Nat/. Acad. Sci. U.S.A., 55:4505-4509 

25 (1992). 

For example, peptides that bind to an MTSP20 polypeptide or a protease 
domain of an SP protein can be identified using phage display libraries, in an 
exemplary embodiment, this method can include (a) contacting phage from a 
phage library with the MTSP20 polypeptide or a protease domain thereof; (b) 
30 isolating phage that bind to the protein; and (c) determining the identity of at 
least one peptide coded by the isolated phage to identify a peptide that binds to 
an MTSP20 polypeptide. 
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H. Modulators of an activity of MTSP20 polypeptidas 

Provided herein are compounds, identified by screening or produced using 
an MTSP20 polypeptide or protease domains thereof in other screening methods, 
that modulate an activity of an MTSP20. These compounds act by directly 
5 interacting with the MTSP20 polypeptide or by altering transcription or 

translation thereof. Such molecules include, but are not limited to, antibodies 
that specifically react with an MTSP20 polypeptide, particularly with a protease 
domain thereof, antisense nucleic acids or double-stranded RNA (dsRNA) such as 
RNAi, that alter expression of an MTSP20 polypeptide, antibodies, peptide 
10 mimetics and other such compounds. 

1 . Antibodies 

Antibodies, including polyclonal and monoclonal antibodies, that 
specifically bind to the MTSP20 polypeptide provided herein, particularly to the 
single chain protease domains thereof or the activated forms of the full-length or 

15 protease domain or the zymogen form, are provided. 

Generally, the antibody is a monoclonal antibody, and typically the 
antibody specifically binds to a protease domain of the MTSP20 polypeptide. In 
particular embodiments, antibodies to each of the single chain and or any of the 
multi-chain forms of the protease domains of MTSP20 are provided. Also 

20 provided are antibodies that specifically bind to any domain of MTSP20 and to 
multi-chain forms thereof. 

The MTSP20 polypeptide and domains, fragments, homologs and 
derivatives thereof can be used as immunogens to generate antibodies that 
specifically bind such Immunogens. Such antibodies include but are not limited 

25 to polyclonal, monoclonal, chimeric, single chain. Fab fragments, and an Fab 
expression library. In a specific embodiment, antibodies to human MTSP20 
polypeptide are produced. In another embodiment, complexes formed from 
fragments of an MTSP20 polypeptide, that contain the serine protease domain 
are used as immunogens for antibody production. 

30 Various procedures known in the art can be used for the production of 

polyclonal antibodies to MTSP20 polypeptide, its domains, derivatives, 
fragments or analogs. For production of the antibody, various host animals can 
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be immunized by injection with the native MTSP20 polypeptide or a synthetic 
version, or a derivative of the foregoing, such as a cross-linked MTSP20 
polypeptide. Such host animals include but are not limited to rabbits, mice, rats, 
etc. Various adjuvants can be used to increase the immunological response, 
5 depending on the host species, and include but are not limited to Freund's 
(complete and incomplete), mineral gels such as aluminum hydroxide, surface 
active substances such as lysolecithin^ pluronic polyols, polyanions, peptides, oil 
emulsions, dinitrophenol, and potentially useful human adjuvants such as bacille 
Calmette-Guerin (BCG) and corynebacterium parvum. 

10 For preparation of monoclonal antibodies directed towards an MTSP20 

polypeptide or domains, derivatives, fragments or analogs thereof, any technique 
that provides for the production of antibody molecules by continuous cell lines in 
culture can be used. Such techniques include but are not restricted to the 
hybridoma technique originally developed by Kohler and Milstein {Nature 

15 256:495-497 (1975)1, the trioma technique, the human B-cell hybridoma 
technique (Kozbor et al., immunology Today 4.12 (1983)); and the EBV 
hybridoma technique to produce human monoclonal antibodies (Cole et al,, in 
MonocionaJ Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96 
{1985)}. In an additional embodiment, monoclonal antibodies can be produced in 

20 germ-free animals utilizing recent technology (PCT/US90/02545). Human 

antibodies can be used and can be obtained by using human hybridomas (Cote 
et al.. Proc, Natl, Acad. Sci. USA 50:2026-2030 (1983)), or by transforming 
human B cells with EBV virus in vitro (Cole et al., in Monocionai Antibodies and 
Cancer Therapy, Alan R. Liss, Inc., pp. 77-96 (1985)). Techniques developed 

25 for the production of "chimeric antibodies" (Morrison et al., Proc, Nati. Acad. 
Sci. USA 81:6851-6855 (1984); Neuberger et al.. Nature 5;2:604-608 (1984); 
Takeda et al.. Nature 5/4:452-454 (1985)) by splicing the genes from a mouse 
antibody molecule specific for the MTSP20 polypeptide together with genes from 
a human antibody molecule of appropriate biological activity can be used. 

30 MTSP20-encoding nucleic acid molecules or portions thereof can be used 

in DNA immunization protocols to produce antibodies that bind to MTSP20 (see, 
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e.g., U.S. Patent No. 5,795,872 and U.S. Patent No. 5,643,578 and U.S. Patent 
No. 6,337,072). 

Techniques described for the production of single chain antibodies (U.S. 
patent 4,946,778) can be adapted to produce MTSP20 polypeptide-specific 
5 single chain antibodies. An additional embodiment uses the techniques 

described for the construction of Fab expression libraries (Huse et a!.. Science 
245:1275-1281 (1989)} to allow rapid and easy identification of monoclonal Fab 
fragments with the desired specificity for MTSP20 polypeptide or domains, 
derivatives, or analogs thereof. Non-human antibodies can be "humanized" by 

10 known methods (see, e.g., U.S. Patent No. 5,225,539). 

Antibody fragments that specifically bind to MTSP20 polypeptide or 
epitopes thereof can be generated by techniques known in the art. For example, 
such fragments include but are not limited to: the F(ab')2 fragment, which can be 
produced by pepsin digestion of the antibody molecule; the Fab' fragments that 

15 can be generated by reducing the disulfide bridges of the F(ab')2 fragment, the 
Fab fragments that can be generated by treating the antibody molecule with 
papain and a reducing agent, and Fv fragments. 

In the production of antibodies, screening for the desired antibody can be 
accomplished by techniques known in the art, e.g., ELISA (enzyme-linked 

20 immunosorbent assay). To select antibodies specific for a particular domain of 
the MTSP20 polypeptide one can assay generated hybridomas for a product that 
binds to the fragment of the MTSP20 polypeptide that contains such a domain. 

The foregoing antibodies can be used in methods known in the art 
relating to the localization and/or quantitation of MTSP20 polypeptide proteins, 

25 e.g., for imaging these proteins, measuring levels thereof in appropriate 
physiological samples, in, for example, diagnostic methods. In another 
embodiment, anti-MTSP20 polypeptide antibodies, or fragments thereof, 
containing the binding domain are used as therapeutic agents. 
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2. Peptides, Polypeptides and Peptide Mimetics 
Provided herein are methods for identifying molecules that bind to and 
modulate an activity of MT5P20 polypeptides. Included among molecules that 
bind to MTSPs, such as a single chain protease domain or catalytically active 
5 fragments thereof, are peptides, polypeptides and peptide mimetics, including 
cyclic peptides. Peptide mimetics are molecules or compounds that mimic the 
necessary molecular conformation of a Irgand or polypeptide for specific binding 
to a target molecule such as an MTSP20 polypeptide. In an exemplary 
embodiment, the peptides, polypeptides and peptide mtmetics bind to a protease 

10 domain of an MTSP20 polypeptide. Such peptides and peptide mimetics include 
those of antibodies that specifically bind to an MTSP20 polypeptide and. 
typically, bind to a protease domain of an MTSP20 polypeptide. The peptides, 
polypeptides and peptide mimetics identified by methods provided herein can be 
agonists or antagonists of MTSP20 polypeptides. 

1 5 Such peptides, polypeptides and peptide mimetics are useful for 

diagnosing, treating, preventing, and screening for a disease or disorder 
associated with MTSP20 polypeptide activity in a mammal. In addition, the 
peptides and peptide mimetics are useful for identifying, isolating, and purifying 
molecules or compounds that modulate an activity of an MTSP20 polypeptide, or 

20 specifically bind to an MTSP20 polypeptide, generally a protease domain of an 
MTSP20 polypeptide. Low molecular weight peptides and peptide mimetics can 
have strong binding properties to a target molecule, e.g., an r^TSP20 
polypeptide or a protease domain of an MTSP20 polypeptide. 

Peptides, polypeptides and peptide mimetics that bind to MTSP20 

25 polypeptides as described herein can be administered to mammals, including 
humans, to modulate MTSP20 polypeptide activity. Thus, methods for 
therapeutic treatment and prevention of neoplastic diseases comprise ^ 
administering a peptide, polypeptide or peptide mimetic compound in an amount 
sufficient to modulate such activity are provided. Thus, also provided herein are 

30 methods for treating a subject having such a disease or disorder in which a 

peptide, polypeptide or peptide mimetic compound is administered to the subject 
in a therapeutically effective dose or amount. 



wo 03/004681 



PCT/US02/21208 



-111- 

Composftions containing the peptides, polypeptides or peptide mimetics 
can be administered for prophylactic and/or therapeutic treatments. In 
therapeutic applications, compositions can be administered to a patient already 
suffering from a disease, as described above, in an amount sufficient to cure or 
5 at least partially arrest the symptoms of the disease and its complications. 

Amounts effective for this use will depend on the severity of the disease and the 
weight and general state of the patient and can be empirically determined. 

In prophylactic applications, compositions containing the peptides, 
polypeptides and peptide mimetics are administered to a patient susceptible to or 

10 otherwise at risk of a particular disease. Such an amount is defined to be a 

"prophylactically effective dose." In this use, the precise amounts again depend 
on the patient's state of health and weight. Accordingly, the peptides, 
polypeptides and peptide mimetics that bind to an MTSP20 polypeptide can be 
used to prepare pharmaceutical compositions containing, as an active ingredient, 

15 at least one of the peptides, polypeptides or peptide mimetics in association with 
a pharmaceutical carrier or diluent. The compounds can be administered, for 
example, by oral, pulmonary, parenteral (intramuscular, intraperitoneal, 
intravenous (IV) or subcutaneous injection), inhalation (via a fine powder 
formulation), transdermal, nasal, vaginal, recta), or sublingual routes of 

20 administration and can be formutated In dosage forms appropriate for each route 
of administration (see, e.g., International PCT application Nos. WO 93/25221 
and WO 94/17784; and European Patent Application 613,683). 

Peptides, polypeptides and peptide mimetics that bind to MTSP20 
polypeptides are useful In vitro as unique tools for understanding the biological 

25 role of MTSP20 polypeptides, including the evaluation of the many factors 
thought to influence, and be influenced by, the production of MTSP20 
polypeptide. Such peptides, polypeptides and peptide mimetics are also useful in 
the development of other compounds that bind to and modulate an activity of an 
MTSP20 polypeptide, because such compounds provide important Information 

30 on the relationship between structure and activity that should facilitate such 
development. 
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The peptides, polypeptides and peptide mimetics are also useful as 
competitive binders in assays to screen for new MTSP20 polypeptides or 
MTSP20 polypeptide agonists. In such assay embodiments, the compounds can 
be used without modification or can be modified in a variety of ways; for 
5 example, by labeling, such as covalently or non-covalently joining a moiety 

which directly or indirectly provides a detectable signal. In any of these assays, 
the materials thereto can be labeled either directly or indirectly. Possibilities for 
direct labeling include label groups such as: radiolabels such as '^^1 enzymes 
(U.S. Pat. No. 3,645,090), peroxidase and alkaline phosphatase, and fluorescent 

10 labels (U.S. Pat. No. 3,940,475) capable of monitoring the change in 
fluorescence intensity, wavelength shift, or fluorescence polarization. 
Possibilities for indirect labeling include biotinylation of one constituent followed 
by binding to avidin coupled to one of the above label groups. The compounds 
can also include spacers or linkers in cases where the compounds are to be 

15 attached to a solid support. 

Moreover, based on their ability to bind to an MTSP20 polypeptide, the 
peptides, polypeptides and peptide mimetics can be used as reagents for 
detecting MTSP20 polypeptides in living cells, fixed cells, in biological fluids, in 
tissue homogenates and in purified, natural biological materials. For example, 

20 by labelling such peptides, polypeptides and peptide mimetics, cells having 
MTSP20 polypeptides can be identified. In addition, based on their ability to 
bind an MTSP20 polypeptide, the peptides, polypeptides and peptide mimetics 
can be used in in situ staining, FACS (fluorescence-activated cell sorting). 
Western blotting, ELISA and other analytical protocols. Based on their ability to 

25 bind to an MTSP20 polypeptide, the peptides, polypeptides and peptide mimetics 
can be used in purification of MTSP20 polypeptides or in purifying cells 
expressing the MTSP20 polypeptides, e.g., a polypeptide encoding a protease 
domain of an MTSP20 polypeptide. 

The peptides, polypeptides and peptide mimetics can also be used as 

30 commercial reagents for various medical research and diagnostic uses. The 

activity of the peptides and peptide mimetics can be evaluated either in vitro or 
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in vivo in one of the numerous models described in McDonald (1992) Am, J, of 
Pediatric HemBtotogy /Oncology, 74:8-21. 

3. Peptide, polypeptide and peptide mimetic therapy 
Peptide analogs are commonly used in the pharmaceutical industry as 
5 non-peptide drugs with properties analogous to those of the template peptide. 
These types of non-peptide compounds are termed "peptide mimetics" or 
"peptidomimetics" (Luthman etal, A Textbook of Drug Design and 
Development, 14:386-406, 2nd Ed., Harwood Academic Publishers (1996); 
Joachim Grante (1994) Angew. Chem. Int. Ed. Engl., 55:1699-1720; Fauchere 

10 (1986) J. Adv. Drug Res., 75:29; Veber and Freidinger (1985) TINS, p. 392; and 
Evans eta!. (1987) J, Med. Chem. 30:1229). Peptide mimetics that are 
structurally similar to therapeutically useful peptides can be used to produce an 
equivalent or enhanced therapeutic or prophylactic effect. Preparation of 
peptidomimetics and structures thereof are known to those of skill in this art. 

1 5 Systematic substitution of one or more amino acids of a consensus 

sequence with a D-amino acid of the same type [e.g., D-lysine in place of 
L-lysine) can be used to generate more stable peptides. In addition, constrained 
peptides containing a consensus sequence or a substantially Identical consensus 
sequence variation can be generated by methods known in the art (Rizo et al. 

20 (1992) An. /?ev. Biochem., 67:387, incorporated herein by reference); for 

example, by adding internal cysteine residues capable of forming intramolecular 
disulfide bridges which cyclize the peptide. 

Those skilled in the art appreciate that modifications can be made to the 
peptides and mimetics without deleteriously effecting the biological or functional 

25 activity of the peptide. Further, the skilled artisan would know how to design 
non-peptide structures in three dimensional terms, that mimic the peptides that 
bind to a target molecule, e.g., an MTSP20 polypeptide or, generally, a protease 
domain of MTSP20 polypeptides (see, e.g., Eck and Sprang (1989) J. Biol. 
Chem., 26: 17605-18795). 

30 When used for diagnostic purposes, the peptides and peptide mimetics 

can be labeled with a detectable label and, accordingly, the peptides and peptide 
mimetics without such a label can serve as intermediates in the preparation of 
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labeled peptides and peptide mimetics. Detectable labels can be molecules or 
compounds, which when covalently attached to the peptides and peptide 
mimetics, permit detection of the peptide and peptide mimetics in vivo, for 
example, in a patient to whom the peptide or peptide mimetic has been 
5 administered, or in vitro, e.g., in a sample or cells. Suitable detectable labels are 
well known in the art and include, by way of example, radioisotopes, fluorescent 
labels {e.g., fluorescein), and others. The particular detectable label employed is 
not critical and is selected to be detectable at non-toxic levels. Selection of 
such labels is well within the skill of the art. 

1 0 Covatent attachment of a detectable label to the peptide or peptide 

mimetic is accomplished by conventional methods well known in the art. For 
example, when the ^^^1 radioisotope Is employed as the detectable label, covalent 
attachment of ^^^1 to the peptide or the peptide mimetic can be achieved by 
incorporating the amino acid tyrosine into the peptide or peptide mimetic and 

15 then iodinating the peptide (see. e.g., Weaner et ai 11994) Synthesis and 

Applications of isotopically Labelled Compounds, pp. 137-140). If tyrosine is 
not present in the peptide or peptide mimetic, incorporation of tyrosine to the N 
or C terminus of the peptide or peptide mimetic can be achieved by well known 
chemistry. Likewise, "P can be incorporated onto the peptide or peptide 

20 mimetic as a phosphate moiety through, for example, a hydroxy! group on the 
peptide or peptide mimetic using conventional chemistry. 

Labeling of peptidomimetlcs usually involves covalent attachment of one 
or more labels, directly or through a spacer {e.g., an amide group), to 
non-interfering position(s) on the peptidomimetic that are predicted by 

25 quantitative structure-activity data and/or molecular modeling. Such 

non-interfering positions generally are positions that do not form direct contacts 
with the macromolecules{s) to which the peptidomimetic binds to produce the 
therapeutic effect. Derivatizatlon {e.g.. labeling) of peptidomimetics should not 
substantially interfere with the desired biological or pharmacological activity of 

30 the peptidomimetic. 

Peptides, polypeptides and peptide mimetics that can bind to an MTSP20 
polypeptide or a protease domain of MTSP20 polypeptides and/or modulate an 
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activity thereof, or exhibit an MTSP20 polypeptide activity, can be used for 
treatment of neoplastic disease. The peptides, polypeptides and peptide 
mimetics can be delivered, in vivo or ex vivo, to the cells of a subject in need of 
treatment. Further, peptides which have MTSP20 polypeptide activity can be 
5 delivered, in vivo or ex vivo, to cells which carry mutant or missing alleles 

encoding the MTSP20 polypeptide gene. Any of the techniques described herein 
or known to the skilled artisan can be used for preparation and in vivo or ex vivo 
delivery of such peptides, polypeptides and peptide mimetics that are 
substantially free of other human proteins. For example, the peptides, 

10 polypeptides and peptide mimetics can be readily prepared by expression in a 
microorganism or synthesis in vitro. 

The peptides or peptide mimetics can be introduced into cells, in vivo or 
ex vivo, by microinjection or by use of liposomes, for example. Alternatively, the 
peptides, polypeptides or peptide mimetics can be taken up by cells, in vivo or 

15 ex vivo, actively or by diffusion. In addition, extracellular application of the 

peptide, polypeptide or peptide mimetic can be sufficient to effect treatment of a 
neoplastic disease. Other molecules, such as drugs or organic compounds, that: 
1 ) bind to a MTSP20 polypeptide or protease domain thereof; or 2) have a 
similar function or activity to an MTSP20 polypeptide or protease domain 

20 thereof, can be used in methods for treatment. 
4. Rational drug design 

The goal of rational drug design is to produce structural analogs of 
biologically active polypeptides or peptides of interest or of small molecules or 
peptide mimetics with which they Interact {e.g., agonists and antagonists) in 

25 order to fashion drugs which are, e.g., more active or stable forms thereof; or 
which, for example, enhance or interfere with the function of a polypeptide in 
vivo (e.g., an MTSP20 polypeptide). In one approach, one first determines, the 
three-dimensional structure of a protein of interest (e.g., an MTSP20 polypeptide 
or polypeptide having a protease domain) or, for example, of an MTSP20 

30 polypeptide-Iigand complex, by X-ray crystallography, by computer modeling or 
most typically, by a combination of approaches (see, e.g., Erickson et at. 1990). 
Also, useful information regarding the structure of a polypeptide can be gained 
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by modeling based on the structure of homologous proteins. In addition, 
peptides can be analyzed by an alanine scan. In this technique, an amino acid 
residue is replaced by Ala, and its effect on the peptide's activity is determined. 
Each of the amino acid residues of the peptide is analyzed in this manner to 
5 determine the important regions of the peptide. 

Also, a polypeptide or peptide that binds to an MTSP20 polypeptide or, 
generally, a protease domain of an MTSP20 polypeptide, can be selected by a 
functional assay, and then the crystal structure of this polypeptide or peptide 
can be determined. The polypeptide can be, for example, an antibody specific 

10 for an MTSP20 polypeptide or the protein domain of an MTSP20 polypeptide. 
This approach can yield a pharmacophore upon which subsequent drug design 
can be based. Further, it is possible to bypass the crystallography altogether by 
generating anti-idiotypic polypeptides or peptides, (anti-ids) to a functional, 
pharmacologically active polypeptide or peptide that binds to an MTSP20 

15 polypeptide or protease domain of an MTSP20 polypeptide. As a mirror image 
of a mirror image, the binding site of the anti-ids is expected to be an analog of 
the original target molecule, e.g., an MTSP20 polypeptide or polypeptide having 
an MTSP20 polypeptide. The anti-id could then be used to identify and isolate 
peptides from banks of chemically or biologically produced banks of peptides. 

20 Selected peptides would then act as the pharmacophore. 

Thus, one can design drugs which have, for example, improved activity or 
stability or which act as modulators {e.g., inhibitors, agonists or antagonists) of 
MTSP20 polypeptide activity, and are useful in the methods, particularly the 
methods for diagnosis, treatment, prevention, and screening of a neoplastic 

25 disease. By virtue of the availability of nucleic acid that encodes MTSP20 
polypeptides, sufficient amounts of the MTSP20 polypeptide can be made 
available to perform such analytical studies as X-ray crystallography. In addition, 
the knowledge of the amino acid sequence of an MTSP20 polypeptide or a 
protease domain thereof, e.g., a protease domain encoded by the amino acid 

30 sequence of SEQ ID Nos. 16 and 6, can provide guidance on computer modeling 
techniques in place of, or in addition to. X-ray crystallography. 
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Methods of identifying peptides and peptide mimetics that bind to 
MTSP20 polypeptides 

Peptides having a binding affinity to an MTSP20 polypeptide provided 

herein {e.g., an MTSP20 polypeptide or a polypeptide having a protease domain 

5 of an MTSP20 polypeptide) can be readily identified, for example, by randonn 

peptide diversity generating systems coupled with an affinity enrichment 

process. Specifically, random peptide diversity generating systems include the 

"peptides on plasmids" system (see, e.g., U.S. Patent Nos. 5,270,170 and 

5,338,665); the "peptides on phage" system (see, e.g., U.S. Patent No. 

10 6,121,238 and Cwirla,ef sA (1990) Proc. Nat/. Acad, Sc/'. U.S.A. 

57:6378-6382); the "polysome system;" the "encoded synthetic library (ESL)" 

system; and the "very large scale immobilized polymer synthesis" system (see, 

e.g., U.S. Patent No. 6,121,238; and Dower et aL (1991) An. Rep. Med. Chem, 

26:271-280). 

1 5 For example, using the procedures described above, random peptides can 

generally be designed to have a defined number of amino acid residues in length 
[e.g.. 12). To generate the collection of oligonucleotides encoding the random 
peptides, the codon motif (NNK)x, where N is nucleotide A, C, G, or T 
(equimolar; depending on the methodology employed, other nucleotides can be 

20 employed), K is G or T (equimolar), and x is an integer corresponding to the 

number of amino acids in the peptide {e.g.. 1 2) can be used to specify any one 
of the 32 possible codons resulting from the NNK motif: 1 for each of 12 amino 
acids, 2 for each of 5 amino acids, 3 for each of 3 amino acids, and only one of 
the three stop codons. Thus, the NNK motif encodes all of the amino acids, 

25 encodes only one stop codon, and reduces codon bias. 

The random peptides can be presented, for example, either on the surface 
of a phage particle, as part of a fusion protein containing either the pill or the 
pVIII coat protein of a phage fd derivative (peptides on phage) or as a fusion 
protein with the Lad peptide fusion protein bound to a plasmid (peptides on 

30 plasmids). The phage or plasmids, including the DNA encoding the peptides, can 
be Identified and isolated by an affinity enrichment process using immobilized 
MTSP20 polypeptide having a protease domain. The affinity enrichment 
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process, sometimes called "panning," typically involves multiple rounds of 
incubating the phage, plasmids, or polysomes with the immobilized MTSP20 
polypeptide, collecting the phage, plasmids, or polysomes that bind to the 
MTSP20 polypeptide (along with the accompanying DNA or mRNA), and 
5 producing more of the phage or plasmids (along with the accompanying 
Lacl-peptide fusion protein) collected. 

Characteristics of peptides and peptide mimetics 
Among the peptides, polypeptides and peptide mimetics for therapeutic 
application are those of having molecular weights from about 250 to about 

10 8,000 daltons. If such peptides are oligomerized, dimerized and/or derivatized 
with a hydrophilic polymer [e.g., to increase the affinity and/or activity of the 
compounds}, the molecular weights of such peptides can be substantially greater 
and can range anywhere from about 500 to about 1 20,000 daltons, generally 
from about 8,000 to about 80,000 daltons. Such peptides can contain 9 or 

15 more amino acids that are naturally occurring or synthetic (non-naturally 

occurring) amino acids. One skilled In the art can determine the affinity and 
molecular weight of the peptides and peptide mimetics suitable for therapeutic 
and/or diagnostic purposes (e.^., see Dower eta/., U.S. Patent No. 6,121,238). 
The peptides can be covalently attached to one or more of a variety of 

20 hydrophilic polymers. Suitable hydrophilic polymers include, but are not limited 
to, polyalkylethers as exemplified by polyethylene glycol and polypropylene 
glycol, polylactic acid, polyglycolic acid, polyoxyalkenes, polyvinylalcohol, 
polyvinylpyrrolidone, cellulose and cellulose derivatives, dextran and dextran 
derivatives. When the peptide compounds are derivatized with such polymers, 

25 their solubility and circulation half-lives can be increased with little, if any, 

diminishment in their binding activity. The peptide compounds can be dimerized 
and each of the dimeric subunits can be covalently attached to a hydrophilic 
polymer. The peptide compounds can be PEGylated, i.e., covalently attached to 
polyethylene glycol (PEG). 
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5. Methods of preparing peptides and peptide mimetics 
Peptides that bind to MTSP20 polypeptides can be prepared by classical 
methods known in the art, for example, by using standard solid phase 
techniques. The standard methods include exclusive solid phase synthesis, 
5 partial solid phase synthesis methods, fragment condensation, classical solution 
synthesis, and even by recombinant DNA technology (see, e.g., Merrifield 
(1963) J. Am. Chem. Soc, 55:2149, incorporated herein by reference.) 

Using the "encoded synthetic library" or "very large scale immobilized 
polymer synthesis" systems (see, e.g., U.S. Patent No. 5,925,525, and 

10 5,902,723), the minimum size of a peptide with an activity of interest can be 
determined. In addition all peptides that form the group of peptides that differ 
from the desired motif (or the minimum size of that motif) in one, two, or more 
residues can be prepared. This collection of peptides then can be screened for 
the ability to bind to the target molecule, e.g., MTSP20 polypeptide or, 

15 generally, a protease domain of an MTSP20 polypeptide. This immobilized 

polymer synthesis system or other peptide synthesis methods can also be used 
to synthesize truncation analogs and deletion analogs and combinations of 
truncation and deletion analogs of the peptide compounds. 

These procedures can also be used to synthesize peptides in which amino 

20 acids other than the 20 naturally occurring, genetically encoded amino acids are 
substituted at one, two, or more positions of the peptide. For instance, 
naphthylalanine can be substituted for tryptophan, facilitating synthesis. Other 
synthetic amino acids that can be substituted into the peptides include 
L-hydroxypropyl, L-3, 4-dihydroxy-phenylalanyl, d amino acids such as 

25 L-d-hydroxylysyl and D-d-methylalanyl, L-o-methylalanyl, /? amino acids, and 
isoquinotyl. D amino acids and non-naturally occurring synthetic amino acids 
can also be incorporated into the peptides (see, e.g., Roberts era/. (1983) 
Unusual Amino/Acids in Peptide Synthesis, 5(6):341 -449). 

The peptides also can be modified by phosphorylation (see, e.g., W. 

30 Bannwarth et af. (1 996) Bioorganic and Medicinal Chemistry Letters, 

6(17):2141-2146), and other methods for making peptide derivatives (see, e.g., 
Hruby et al. (1990) Biochem. J., 26S{2):249-262). Thus, peptide compounds 
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atso serve as a basis to prepare peptide mimetics with similar or improved 
biological activity. 

Those of skill in the art recognize that a variety of techniques are 
available for constructing peptide mimetics with the same or similar desired 
5 biological activity as the corresponding peptide compound but with more 
favorable activity than the peptide with respect to solubility, stability, and 
susceptibility to hydrolysis and proteolysis {see, e.g., Morgan eta/. {^9S9) An. 
Hep. Med. Chem., 24:243-252), Methods for preparing peptide mimetics 
modified at the N-terminat amino group, the C-terminal carboxy) group, and/or 

10 changing one or more of the amido linkages in the peptide to a non-amido 
linkage are known to those of skill in the art. 

Amino terminus modifications include, but are not limited to, alkylating, 
acetylating and adding a carbobenzoyi group, forming a succinimide group (see. 
e.g., Murray et a/. (1995) Burger's Medicinal Chemistry and Drug Discovery, 5th 

15 erf.. Vol. 1, Manfred E. Wolf, ed., John Witey and Sons, Inc.K C-terminal 

modifications include mimetics wherein the C-terminal carboxyl group is replaced 
by an ester, an amide or modifications to form a cyclic peptide. 

In addition to N-terminal and C-terminal modifications, the peptide 
compounds, including peptide mimetics, can advantageously be modified with or 

20 covalently coupled to one or more of a variety of hydrophilic polymers. It has 
been found that when peptide compounds are derivatized with a hydrophilic 
polymer, their solubility and circulation half-lives can be increased and their 
immunogenicity is masked, with little, if any, diminishment in their binding 
activity. Suitable nonproteinaceous polymers include, but are not limited to, 

25 polyalkylethers as exemplified by polyethylene glycol and polypropylene glycol, 
polylactic acid, polyglycotic acid, polyoxyalkenes, polyvinylalcohol, 
polyvinylpyrrolidone, cellulose and cellulose derivatives, dextran and dextran 
derivatives. Generally, such hydrophilic polymers have an average molecular 
weight ranging from about 500 to about 100,000 daltons. Including from about 

30 2,000 to about 40,000 daltons and, from about 5,000 to about 20,000 daltons. 
The hydrophilic polymers also can have an average molecular weights of about 
5,000 daltons, 10,000 daltons and 20,000 daltons. 
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Methods for derivatizing peptide compounds or for coupling peptides to 
such polynners have been described (see, e.g., Zailipsky (1995) Bioconjugate 
Chem., 5:150-165; Monfardini era/. (1995) Bioconjugate Chem., 5:62-69; U.S. 
Pat. No. 4,640,835; U.S. Pat. No. 4,496,689; U.S. Pat. No. 4,301,144; U.S. 
5 Pat. No. 4,670,417; U.S. Pat. No. 4,791,192; U.S. Pat. No. 4,179,337 and WO 
95/34326, all of which are incorporated by reference in their entirety herein). 

Other methods for making peptide derivatives are described, for example, 
in Hruby et al. (1990), Biochem J., 255{2):249-262, which is incorporated 
herein by reference. Thus, the peptide compounds also serve as structural 

10 models for non-peptidic compounds with similar biological activity. Those of 
skill in the art recognize that a variety of techniques are available for 
constructing compounds with the same or similar desired biological activity as a 
particular peptide compound but with more favorable activity with respect to 
solubility, stability, and susceptibility to hydrolysis and proteolysis (see, e.flf., 

15 Morgan et ai. (1989) /in. Rep, Med. Chem., 24:243-252, incorporated herein by 
reference). These techniques include replacing the peptide backbone with a 
backbone composed of phosphonates, amidates, carbamates, sulfonamides, 
secondary amines, and N-methylamino acids. 

Peptide compounds can exist in a cyclized form with an intramolecular 

20 disulfide bond between the thiol groups of the cysteines. Alternatively, an 

intermolecular disulfide bond between the thiol groups of the cysteines can be 
produced to yield a dimeric (or higher oligomeric) compound. One or more of the 
cysteine residues can also be substituted with a homocysteine. 
I. Conjugates 

25 A conjugate, containing: a) a single chain protease domain (or 

proteolytically active portion thereof) or two or a plurality of protease domains of 
an MTSP20 polypeptide or a full length zymogen, activated form thereof, or two 
or single chain protease domain thereof; and b) a targeting agent linked to the 
MTSP20 polypeptide directly or via a linker, wherein the agent facilitates: i) 

30 affinity isolation or purification of the conjugate; ii) attachment of the conjugate 
to a surface; iii) detection of the conjugate; or iv) targeted delivery to a selected 
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tissue or cell, is provided herein. The conjugate can be a chemical conjugate or a 
fusion protein mixture thereof. 

The targeting agent can be a protein or peptide fragment, such as a 
tissue specific or tumor specific monoclonal antibody or growth factor or 
5 fragment thereof linked either directly or via a linker to an MTSP20 polypeptide 
or a protease domain thereof. The targeting agent can also be a protein or 
peptide fragment that contains a protein binding sequence, a nucleic acid binding 
sequence, a lipid binding sequence, a polysaccharide binding sequence, or a 
metal binding sequence, or a linker for attachment to a solid support. In a 

10 particular embodiment, the conjugate contains a) the MTSP20 or portion thereof, 
as described herein; and b) a targeting agent linked to the MTSP20 polypeptide 
directly or via a linker. 

Conjugates, such as fusion proteins and chemical conjugates, of the 
MTSP20 polypeptide with a protein or peptide fragment (or plurality thereof) that 

1 5 functions, for example, to facilitate affinity isolation or purification of the 

MTSP20 polypeptide domain, attachment of the MTSP20 polypeptide domain to 
a surface, or detection of the MTSP20 polypeptide domain are provided. The 
conjugates can be produced by chemical conjugation, such as via thiol linkages, 
and can be produced by recombinant means as fusion proteins. In the fusion 

20 protein, the peptide or fragment thereof is linked to either the N-terminus or C- 
terminus of the MTSP20 polypeptide domain. In chemical conjugates the 
peptide or fragment thereof can be linked anywhere that conjugation can be 
effected, and there can be a plurality of such peptides or fragments linked to a 
single MTSP20 polypeptide domain or to a plurality thereof. 

25 The targeting agent is for in vitro or in vivo delivery to a cell or tissue, 

and includes agents such as cell or tissue-specific antibodies, growth factors and 
other factors (including compounds) that bind to moieties expressed on specific 
cells; and other cell or tissue specific agents that promote directed delivery of a 
linked protein. The targeting agent can be one that specifically delivers the 
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MTSP20 polypeptide to selected cells by interaction with a cell surface protein 
and internalization of conjugate or MTSP20 polypeptide portion thereof. 

These conjugates are used in a variety of methods and are particularly 
suited for use in nnethods of activation of prodrugs, such as prodrugs that upon 
5 cleavage by the particular MTSP20, which is localized at or near the targeted cell 
or tissue, are cytotoxic. The prodrugs are administered prior to, or 
sinnultaneously with, or subsequently to the conjugate. Upon delivery to the 
targeted cells, the protease activates the prodrug, which then exhibits a 
therapeutic effect, such as a cytotoxic effect. 

10 1 . Conjugation 

Conjugates with linked IVITSP20 polypeptide domains can be prepared 
either by chemical conjugation, recombinant DNA technology, or combinations of 
recombinant expression and chemical conjugation. The MTSP20 polypeptide 
domains and the targeting agent can be linked in any orientation and more than 

1 5 one targeting agent and/or MTSP20 polypeptide domain can be present in a 
conjugate. 

a. Fusion proteins 

Fusion proteins are provided herein. A fusion protein contains: a) one or 
a plurality of domains of an MTSP20 polypeptide and b) a targeting agent. The 
20 fusion proteins are generally produced by recombinant expression of nucleic 
acids that encode the fusion protein. 

b. Chemical conjugation 

To effect chemical conjugation herein, the MTSP20 polypeptide domain is 
linked via one or more selected linkers or directly to the targeting agent. 
25 Chemical conjugation must be used if the targeted agent is other than a peptide 
or protein, such as a nucleic acid or a non-peptide drug. Any means known to 
those of skill in the art for chemically conjugating selected moieties can be used. 

2. Linkers 

Linkers for two purposes are contemplated herein. The conjugates can 
30 include one or more tinkers between the MTSP20 polypeptide portion and the 
targeting agent. Additionally, linkers are used for facilitating or enhancing 
immobilization of an MTSP20 polypeptide or portion thereof on a solid support. 
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such as a microtiter plate, silicon or silicon-coated chip, glass or plastic support, 
such as for high throughput solid phase screening protocols. 

Any Ymker known to those of skill in the art for preparation of conjugates 
can be used herein. These linkers are typically used in the preparation of 
5 chemical conjugates; peptide linkers can be incorporated into fusion proteins. 

Linkers can be any moiety suitable to associate a domain of MTSP20 
polypeptide and a targeting agent. Such linkers and linkages include, but are not 
limited to, amino acid and peptide linkages, typically containing between one and 
about 60 amino acids, more generally between about 10 and 30 amino acids, 

10 chemical linkers, such as heterobifunctional cleavabte cross-linkers, including but 
are not limited to, N-succinimidyl (4-iodoacetyl)-aminobenzoate, 
sulfosuccinimydil (4-iodoacetyl)-aminobenzoate, 4-succinimldyl-oxycarbonyl-a- 
(2-pyridyldithio)toluene, sulfosuccinimidyl-6- [a-methyl-a-(pyridyldithiol)- 
toluamido] hexanoate, N-succinimidyl-3-(-2-pyridyldithio) - proprionate, 

15 succinimidyl 6[3(-(-2-pyridvldithio)-proprionamido] hexanoate, sulfosuccinimidyl 
6[3{-{-2-pyridyldithio)-propionamidol hexanoate, 3-(2-pyridyldithio|-propionyl 
hydrazide, Ellman's reagent, dichlorotriazinic acid, and S-(2-thiopyridyl)-L- 
cysteine. Other linkers include, but are not limited to peptides and other 
moieties that reduce steric hindrance between the domain of MTSP20 

20 polypeptide and the targeting agent, intracellular enzyme substrates, linkers that 
increase the flexibility of the conjugate, tinkers that increase the solubility of the 
conjugate, linkers that increase the serum stability of the conjugate, 
photocleavable linkers and acid cleavable linkers. 

Other exemplary linkers and linkages that are suitable for chemically 

25 linked conjugates include, but are not limited to, disulfide bonds, thioether 
bonds, hindered disulfide bonds, and covalent bonds between free reactive 
groups, such as amine and thiol groups. These bonds are produced using 
heterobifunctional reagents to produce reactive thiol groups on one or both of 
the polypeptides and then reacting the thiol groups on one polypeptide with 

30 reactive thiol groups or amine groups to which reactive maleimido groups or thiol 
groups can be attached on the other. Other linkers include, acid cleavable 
linkers, such as bismaleimideothoxy propane, acid labile-transferrin conjugates 
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and adipic acid dihydrazide, that would be cleaved in more acidic Intracellutar 
compartments; cross linkers that are cleaved upon exposure to UV or visible 
light; and linkers, such as various domains, such as ChI, Ch2, and Ch3, from the 
constant region of human IgG, (see, Batra et ai Molecular Immunol., 30:379- 
5 386 (1993)). In some embodiments, several linkers can be included in order to 
take advantage of desired properties of each linker. 

Chemical linkers and peptide linkers can be inserted by covalently 
coupling the linker to the domain of MTSP20 polypeptide and the targeting 
agent. The heterobifunctional agents, described below, can be used to effect 

10 such covalent coupling. Peptide linkers can also be linked by expressing DNA 
encoding the linker and therapeutic agent (TA), linker and targeted agent, or 
linker, targeted agent and therapeutic agent (TA) as a fusion protein. Flexible 
linkers and linkers that increase solubility of the conjugates are contemplated for 
use, either alone or with other linkers are also contemplated herein. 

15 a. Acid 'cleavable, photocleavable and heat sensitive linkers 

Acid cleavable linkers, photocleavable and heat sensitive linkers can also 
be used, particularly where it can be necessary to cleave the domain of MTSP20 
polypeptide to permit it to be more readily accessible to reaction. Acid cleavable 
linkers include, but are not limited to, bismaletmideothoxy propane; and adipic 

20 acid dihydrazide linkers (see, e.g.. Fattom et al. (1992) Infection & Immun. 
50:584-589) and acid labile transferrin conjugates that contain a sufficient 
portion of transferrin to permit entry into the intracellular transferrin cycling 
pathway (see, e.g., Welhoner et al. (1991) J. B/ol. Chem. 255:4309-4314). 

Photocleavable linkers are linkers that are cleaved upon exposure to light 

25 (see, e.g., Goldmacher et al. (1992) Bioconj. Chem. 5:104-107, which linkers 
are herein incorporated by reference), thereby releasing the targeted agent upon 
exposure to light. Photocleavable linkers that are cleaved upon exposure to light 
are known (see, e.g., Hazum et al. (1981) in Pept., Proc. Eur. Pept. Symp., 
16th, Brunfeldt, K (Ed), pp. 105-1 10, which describes the use of a nitrobenzyl 

30 group as a photocleavable protective group for cysteine; Yen et al. (1989) 
Makromol. Chem 750:69-82, which describes water soluble photocleavable 
copolymers, including hydroxypropylmethacrylamide copolymer, glycine 
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copolymer, fluorescein copolymer and nnethylrhodamine copolymer; Gold- 
macher era/. (1992) BioconJ. Chem. 3:104-107, which describes a cross-linker 
and reagent that undergoes photolytic degradation upon exposure to near UV 
light (350 nm); and Senter et ah (1985) Photochem. Photobiol 42:231-237, 
5 which describes nitrobenzyloxycarbonyl chloride cross linking reagents that 
produce photocieavabte linkages), thereby releasing the targeted agent upon 
exposure to light. Such linkers would have particular use in treating 
dermatologicai or ophthalmic conditions that can be exposed to light using fiber 
optics. After administration of the conjugate, the eye or skin or other body part' 
10 can be exposed to light, resulting in release of the targeted moiety from the 

conjugate. Such photocteavable linkers are useful in connection with diagnostic 
protocols in which it is desirable to remove the targeting agent to permit rapid 
clearance from the body of the animal. 

b. Other linkers for chemical conjugation 

1 5 Other linkers, include trityl linkers, particularly, dertvatized 

trityt groups to generate a genus of conjugates that provide for 
release of therapeutic agents at various degrees of acidity or alkalinity. 
The flexibility thus afforded by the ability to preselect the pH range at 
which the therapeutic agent is released allows selection of a linker based on the 

20 known physiological differences between tissues in need of delivery of a 
therapeutic agent {see, e,g., U.S. Patent No, 5,612,474). For example, the 
acidity of tumor tissues appears to be lower than that of normal tissues. 

c. Peptide linkers 

The linker moieties can be peptides. Peptide tinkers can be employed in 
25 fusion proteins and also in chemically linked conjugates. The peptide typically 
has from about 2 to about 60 amino acid residues, for example from about 5 to 
about 40, or from about 10 to about 30 amino acid residues. The length 
selected depends upon factors, such as the use for which the linker is included. 
Peptide linkers are advantageous when the targeting agent is 
30 proteinaceous. For example, the linker moiety can be a flexible spacer amino 
acid sequence, such as those known in single-chain antibody research. 
Examples of such known linker moieties include, but are not limited to. 
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peptides, such as (Gly^Ser)„ and {Ser^Gly|„, in which n is 1 to 6, including 1 to 
4 and 2 to 4, and m is 1 to 6, including 1 to 4, and 2 to 4, enzyme cleavable 
linkers and others. 

Additional linking moieties are described, for example, in Huston era/., 
5 Proc. Nat!. Acad, ScL U.S.A. 55:5879-5883, 1988; Whitlow, M., eiaL, Protein 
Engineering 6:989-995, 1993; Newton etal.. Biochemistry 35:545-553, 1996; 
A. J. Cumber et ai, Bioconj. Chem. 3:397-401, 1992; Ladurner et ai, J, Moi. 
Bio!. 273:330-337, 1997; and U.S. Patent. No. 4,894,443. In some 
embodiments, several linkers can be included in order to take advantage of 
1 0 desired properties of each linker. 

3. Targeting agents 

Any agent that facilitates detection, immobilization, or purification of the 
conjugate is contemplated for use herein. For chemical conjugates any moiety 
that has such properties is contemplated; for fusion proteins, the targeting agent 

15 is a protein, peptide or fragment thereof that is sufficient to effect the targeting 
activity. Contemplated targeting agents Include those that deliver the MTSP20 
polypeptide or portion thereof to selected cells and tissues. Such agents include 
tumor specific monoclonal antibodies and portions thereof, growth factors, such 
as FGF, EGF, PDGF, VEGF, cytokines, including chemokines, and other such 

20 agents. 

4. Nucleic acids, plasmids and cells 

Isolated nucleic acid fragments encoding fusion proteins are provided. 
The nucleic acid fragment that encodes the fusion protein includes: a) nucleic 
acid encoding a protease domain of an MTSP20 polypeptide; and b) nucleic acid 
25 encoding a protein, peptide or effective fragment thereof that facilitates: i) 

affinity isolation or purification of the fusion protein; ii) attachment of the fusion 
protein to a surface; or iii) detection of the fusion protein. Generally, the nucleic 
acid is DNA. 

Plasmids for replication and vectors for expression that contain the above 
30 nucleic acid fragments are also provided. Cells containing the plasmids and 

vectors are also provided. The cells can be any suitable host including, but are 
not limited to, bacterial cells, yeast cells, fungal cells, plant cells, insect cell and 
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animal cells. The nucleic acids, plasmids, and cells containing the plasmids can 
be prepared according to methods known in the art including any described 
herein. 

Also provided are methods for producing the above fusion proteins. An 
5 exemplary method includes the steps of growing, for example, culturing the cells 
so that they proliferate, cells containing a plasmid encoding the fusion protein 
under conditions whereby the fusion protein is expressed by the cell, and 
recovering the expressed fusion protein. Methods for expressing and recovering 
recombinant proteins are well known in the art (See generally. Current Protocols 

10 in Molecular Biology (1998) § 16, John Wiley & Sons, Inc.) and such methods 
can be used for expressing and recovering the expressed fusion proteins. 

The recovered fusion proteins can be isolated or purified by methods 
known in the art such as centrifugation, filtration, chromatography, 
electrophoresis, immunoprecipitation, and other such methods, or by a 

1 5 combination thereof (See generally. Current Protocols in Molecular Biology 

(1998) % 10, John Wiley & Sons, Inc.). Generally the recovered fusion protein is 
isolated or purified through affinity binding between the protein or peptide 
fragment of the fusion protein and an affinity binding moiety. As discussed in 
the above sections regarding the construction of the fusion proteins, any affinity 

20 binding pairs can be constructed and used in the Isolation or purification of the 
fusion proteins. For example, the affinity binding pairs can be protein binding 
sequences/protein, DNA binding sequences/ONA sequences, RNA binding 
sequences/RNA sequences, lipid binding sequences/lipid, polysaccharide binding 
sequences/polysaccharide, or metal binding sequences/metal. 

25 B. Immobilization and supports or substrates therefor 

In certain embodiments, where the targeting agents are designed for 
linkage to surfaces, the MTSP20 polypeptide can be attached by linkage such as 
ionic or covalent, non-covalent or other chemical interaction, to a surface of a 
support or matrix material. Immobilization can be effected directly or via a 

30 linker. The MTSP20 polypeptide can be immobilized on any suitable support, 
including, but not limited to, silicon chips, and other supports described herein 
and known to those of skill in the art. A plurality of MTSP20 polypeptide or 
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protease domains thereof can be attached to a support, such as an array He., a 
pattern of two or more) of conjugates on the surface of a silicon chip or other 
chip for use in high throughput protocols and formats. 

It is also noted that the domains of the IVITSP20 polypeptide can be 
5 linked directly to the surface or via a linker without a targeting agent linked 
thereto. Hence chips containing arrays of the domains of the MTSP20 
polypeptide are also provided. 

The matrix material or solid supports contemplated herein are generally 
any of the insoluble materials known to those of skill in the art to immobilize 

10 ligands and other molecules, and are those that are used in many chemical 
syntheses and separations. Such supports are used, for example, in affinity 
chromatography, in the immobilization of biologically active materials, and during 
chemical syntheses of biomolecules, including proteins, amino acids and other 
organic molecules and polymers. The preparation of and use of supports is well 

1 5 known to those of skill in this art; there are many such materials and 

preparations thereof known. For example, naturally-occurring support materials, 
such as agarose and cellulose, can be isolated from their respective sources, and 
processed according to known protocols, and synthetic materials can be 
prepared in accord with known protocols. 

20 The supports are typically insoluble materials that are solid, porous. 

deformable, or hard, and have any required structure and geometry, including, 
but not limited to: beads, pellets, disks, capillaries, hollow fibers, needles, solid 
fibers, random shapes, thin films and membranes. Thus, the item can be 
fabricated from the matrix material or combined with it, such as by coating all or 

25 part of the surface or impregnating particles. 

Typically, when the matrix is particulate, the particles are at least about 
10-2000 fjm, but can be smaller or larger, depending upon the selected 
application. Selection of the matrices is governed, at least in part, by their 
physical and chemical properties, such as solubility, functional groups, 

30 mechanical stability, surface area swelling propensity, hydrophobic or hydrophilic 
properties and intended use. 
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If necessary, the support matrix material can be treated to contain an 
appropriate reactive moiety. In some cases, the support matrix material already 
containing the reactive moiety can be obtained commercially. The support 
matrix material containing the reactive moiety can thereby serve as the matrix 
5 support upon which molecules are linked. Materials containing reactive surface 
moieties such as amino silane linkages, hydroxy! linkages or carboxysilane 
linkages can be produced by well established surface chemistry techniques 
involving silanization reactions, or the like. Examples of these materials are 
those having surface silicon oxide moieties, covalently linked to gamma-amino- 

10 propylsilane, and other organic moieties; N-t3-{trlethyoxysilyOpropyl]phthelamic 
acid; and bis-(2-hydroxyethyl)aminopropyltriethoxysilane. Exemplary readily 
available materials containing amino group reactive functionalities, include, but 
are not limited to, para-aminophenyltriethyoxysilane. Also derivatized 
polystyrenes and other such polymers are weW known and readily available to 

15 those of skill in this art {e.g., the Tentagel® Resins are available with a multitude 
of functional groups, and are sold by Rapp Polymere, Tubingen, Germany; see, 
U.S. Patent No. 4,908,405 and U.S. Patent No. 5,292,814; see, a/so Butz et aL, 
Peptide f^es., 7:20-23 (1994); and Kleine et al., immunobiol., /SO:53-66 
(1994)). 

20 These matrix materials include any material that can act as a support 

matrix for attachment of the molecules of interest. Such materials are known to 
those of skill in this art, and include those that are used as a support matrix. 
These materials include, but are not limited to, inorganics, natural polymers, and 
synthetic polymers, including, but not limited to: cellulose, cellulose derivatives, 

25 acrylic resins, glass, silica gels, polystyrene, gelatin, polyvinyl pyrrolidone, co- 
polymers of vinyl and acrylamide, polystyrene cross-linked with divinylbenzene 
and others {see, Merrifield. Biochemistry. 5:1385-1390 (1964)), 
polyacrylamides, latex gels, polystyrene, dextran, polyacrylamides, rubber, 
silicon, plastics, nitrocellulose, celluloses, natural sponges. Of particular interest 

30 herein, are highly porous glasses (see, e.g., U.S. Patent No. 4,244,721) and 
others prepared by mixing a borosilicate, alcohol and water. 
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Synthetic supports include, but are not limited to: acrylamides, dextran- 
derivatives and dextran co-polymers, agarose-poiyacrylamlde blends, other 
polymers and co-polymers with various functional groups, methacrylate 
derivatives and co-potymers, polystyrene and polystyrene copolymers {see, e.g., 
5 Merrifield, Biochemistry^ 3:1385-1390 (1964); Berg et al., in innovation 

Perspect. Solid Phase Synth. Collect. Pap., Int. Symp., 1st, Epton, Roger (Ed), 
pp. 453-459 (1990); Berg et al., Pept., Proc. Eur. Pept. Symp., 20th, Jung, G. 
etal. (Eds), pp. 196-198 (1989); Berg et al., J. Am. Chem. Soc, 
/; /:8024-8026 (1989); Kent et al., Isr. J. Chem.. / 7:243-247 (1979); Kent et 

10 al., J. Org. Chem., 43:2845-2852 (1978); Mitchell et al.. Tetrahedron Lett., 

42:3795-3798 (1976); U.S. Patent No. 4,507,230; U.S. Patent No. 4,006,1 17; 
and U.S. Patent No. 5,389,449). Such materials include those made from 
polymers and co-polymers such as polyvinylalcohots, acrylates and acrylic acids 
such as polyethylene-co-acrylic acid, polyethylene-co-methacrylic acid, polyethy- 

15 lene-co-ethylacrylate, polyethylene-co-methyl acrylate, polypropylene-co-acrylic 
acid, poiypropylene-co-methyi-acrylic acid, polypropylene-co-ethylacrylate, 
polypropylene-co-methyl acrylate, polyethylene-co-vinyl acetate, poly- 
propylene-co-vinyl acetate, and those containing acid anhydride groups such as 
polyethylene-co-maleic anhydride and polypropylene-co-maleic anhydride. 

20 Liposomes have also been used as solid supports for affinity purifications (Powell 
et al, Biotechnoi Bioeng., 33:M3 (1989)). 

Numerous methods have been developed for the immobilization of 
proteins and other bromolecules onto solid or liquid supports {see, e.g., 
Mosbach, iVIethods in Enzymoiogy, 44 (1976); Weetall, immobilized Enzymes, 

25 An tigens, A n tibodies, and Pep tides , (1975); Kennedy eta!., Solid Phase 

Biochemistry, Analytical and Synthetic Aspects, Scouten, ed., pp. 253-391 
(1983); see, generally, Affinity Techniques. Enzyme Purification: Part B. 
Methods in Enzymoiogy, Vol. 34, ed. W. B. Jakoby, M. Wilchek, Acad. Press, 
N.Y. (1974); and Immobilized Biochemicals and Affinity Chromatography, 

30 Advances In Experimental Medicine ar)d Biology, vol. 42, ed. R. Dunlap, Plenum 
Press, N.Y. (1974)). 
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Among the most commonly used methods are absorption and adsorption 
or covalent binding to the support, either directly or via a tinker, such as the 
numerous disulfide linkages, thioether bonds, hindered disulfide bonds, and ^ 
covalent bonds between free reactive groups, such as amine and thiol groups, 
5 known to those of skill in art (see, e.g.. the PIERCE CATALOG, 

ImmunoTechnology Catalog & Handbook, 1992-1993, which describes the 
preparation of and use of such reagents and provides a commercial source for 
such reagents; Wong, Chemistry of Protein Conjugation and Cross Unking. CRC 
Press (1993); see also DeWitt et al., Proc. Natl. Acad. Sci. U,$,A.. 50:6909 

10 (1993); Zuckermann et al., J. Am. Chem. Sac, //4: 10646 (1992); Kurth et al., 
J. Am. Chem. $oc.. 7 76:2661 (1994); Ellman et a!., Proc. Natl. Acad. Sci. 
U.S.A., 91:^708 (1994); Sucholeiki, Tetrahedron Lttrs., 35:1Z01 (1994); Su- 
Sun Wang, J. Org. Chem., 47:3258 (1976); Padwa et al., J. Org. Chem., 
47:3550 (1971); and Vedejs et al., J. Org. Chem.. 45:575 (1984), which 

15 describe photosensitive linkers). 

To effect immobilization, a composition containing the protein or other 
biomolecule is contacted with a support material such as alumina, carbon, an 
ion-exchange resin, cellulose, glass or a ceramic. Fluorocarbon polymers have 
been used as supports to which biomolecules have been attached by adsorption 

20 [see. U.S. Patent No. 3,843,443; Published International PCT Application 
WO/86 03840). 
J. Prognosis and diagnosis 

MTSP20 polypeptide proteins, domains, analogs, and derivatives thereof, 
and encoding nucleic acids (and sequences complementary thereto), and anti- 

25 MTSP20 polypeptide antibodies, can be used in diagnostics, particularly 

diagnosis of lung, head and neck, such as esophageal tumors, prostate, colon, 
ovary, cervix, breast and pancreas cancers. Such molecules can be used in 
assays, such as immunoassays, to detect, prognose, diagnose, or monitor 
various conditions, diseases, and disorders affecting IVITSP20 polypeptide 

30 expression, or monitor the treatment thereof. For purposes herein, the presence 
of MTSP20S in body fluids or tumor tissues are of particular interest. 
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In particular, such an immunoassay is carried out by a method including 
contacting a sample derived from a patient with an anti-MTSP20 polypeptide 
antibody under conditions such that specific binding can occur, and detecting or 
measuring the amount of any specific binding by the antibody. Such binding of 
5 antibody, in tissue sections, can be used to detect aberrant MTSP20 polypeptide 
localization or aberrant {e.g., increased, decreased or absent) levels of MTSP20 
polypeptide. In a specific embodiment, antibody to an MTSP20 polypeptide can 
be used to assay in a patient tissue or body fluid, such as serum, sample for the 
presence of MTSP20 polypeptide where an aberrant level of MTSP20 

10 polypeptide is an indication of a diseased condition. 

The immunoassays which can be used include but are not limited to 
competitive and non-competitive assay systems using techniques such as 
western blots, radioimmunoassays, ELISA (enzyme linked immunosorbent 
assay), "sandwich" immunoassays, immunoprecipitation assays, precipitin 

1 5 reactions, gel diffusion precipitin reactions, immunodiffusion assays, 

agglutination assays, complement-fixation assays, immunoradiometric assays, 
fluorescent immunoassays and protein A immunoassays. 

MTSP20 polypeptide genes and related nucleic acid sequences and 
subsequences, including complementary sequences, also can be used in 

20 hybridization assays. MTSP20 polypeptide nucleic acid sequences, or 

subsequences thereof containing about at least 8 nucleotides, generally 14 or 16 
or 30 or more, generally less than 1000 or up to 100, contiguous nucleotides 
can be used as hybridization probes. Hybridization assays can be used to 
detect, prognose, diagnose, or monitor conditions, disorders, or disease states 

25 associated with aberrant changes in MTSP20 polypeptide expression and/or 
activity as described herein. In particular, such a hybridization assay is carried 
out by a method of contacting a sample containing nucleic acid with a nucleic 
acid probe capable of hybridizing to MTSP20 polypeptide encoding DNA or RNA, 
under conditions such that hybridization can occur, and detecting or measuring 

30 any resulting hybridization. 

In a specific embodiment, a method of diagnosing a disease or disorder 
characterized by detecting an aberrant level of an MTSP20 polypeptide in a 
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subject is provided herein by measuring the level of the DNA, RNA, protein or 
functional activity of the MTSP20 polypeptide in a sample derived from the 
subject, wherein an increase or decrease in the level of the DNA, RNA, protein or 
functional activity of the MTSP20 polypeptide, relative to the level of the DNA, 
5 RNA, protein or functional activity found in an analogous sample not having the 
disease or disorder indicates the presence of the disease or disorder in the 
subject. 

Kits for diagnostic use are also provided, that contain in one or more 
containers an anti-MTSP20 polypeptide antibody, and, optionally, a labeled 

10 binding partner to the antibody. Alternatively, the anti-MTSP20 polypeptide 
antibody can be labeled (with a detectable marker, e.g., a chemiluminescent, 
enzymatic, fluorescent, or radioactive moiety). A kit is also provided that 
includes in one or more containers a nucleic acid probe capable of hybridizing to 
the MTSP20 polypeptide-encoding nucleic acid. In a specific embodiment, a kit 

15 can include in one or more containers a pair of primers {e.g., each in the size 
range of 6-30 nucleotides) that are capable of priming amplification, e.g., by 
polymerase chain reaction (see e.g., Innis et al., 1990, PCR Protocols, Academic 
Press, Inc., San Diego, CA), ligase chain reaction (see EP 320,308), use of 
reptlcase, cyclic probe reaction, or other methods known in the art) under 

20 appropriate reaction conditions of at least a portion of an MTSP20 polypeptide- 
encoding nucleic acid. A kit can optionally further include in a container a 
predetermined amount of a purified MTSP20 polypeptide or nucleic acid, e.g., for 
use as a standard or control. 

K. Pharmaceutical compositions and modes of administration 

25 1 . Components of the compositions 

Pharmaceutical compositions containing the identified compounds that 
modulate an activity of an MTSP20 polypeptide are provided herein. Also 
provided are combinations of a compound that modulates an activity of an 
MTSP20 polypeptide and another treatment or compound for treatment of a 

30 neoplastic disorder, such as a chemotherapeutic compound. 
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The MTSP20 polypeptide modulator and the anti-tumor agent can be 
packaged as separate compositions for administration together or sequentially or 
intermittently. Alternatively, they can be provided as 

a single composition for administration or as two compositions for administration 
5 as a single composition. The combinations can be packaged as kits. 

a. MTSP20 polypeptide inhibitors 

Any MTSP20 polypeptide inhibitors, including those described herein 
when used alone or in combination with other compounds, that can alleviate, 
reduce, ameliorate, prevent, or place or maintain in a state of remission of 
10 clinical symptoms or diagnostic markers associated with neoplastic diseases, 

including undeslred and/or uncontrolled angiogenesis, can be used in the present 
combinations. 

In one embodiment, the MTSP20 polypeptide inhibitor is an antibody or 
fragment thereof that specifically reacts with an MTSP20 polypeptide or a 
1 5 protease domain thereof, an inhibitor of the MTSP20 polypeptide production, an 
inhibitor of MTSP20 polypeptide membrane-localization, or any inhibitor of the 
expression of or, especially, an activity of an MTSP20 polypeptide. 

b. Anti-angtogenic agents and anti-tumor agents 

Any anti-angiogenic agents and anti-tumor agents, including those 
20, described herein, when used alone or in combination with other compounds, that 
can alleviate, reduce, ameliorate, prevent, or place or maintain in a state of 
remission of clinical symptoms or diagnostic markers associated with undesired 
and/or uncontrolled angiogenesis and/or tumor growth and metastasis, 
particularly solid neoplasms, vascular malformations and cardiovascular 
25 disorders, chronic inflammatory diseases and aberrant wound repairs, circulatory 
disorders, crest syndromes, dermatological disorders, or ocular disorders, can be 
used in the combinations. Also contemplated are anti-tumor agents for use in 
combination with an inhibitor of an MTSP20 polypeptide. 
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c. Pro-angiogenic agent 
Any pro-angiogenic agents, including those described herein, when used 
alone or in combination with other compounds, that can promote physiological 
angiogenesis, particularly angiogenests involved in normal placental, embryonic, 
5 fetal and post-natal development and growth, physiologically cyclical 
development in the ovarian follicle, corpus luteum and post-menstrual 
endometrium or wound healing, can be used in the present combinations. 

The pro-angiogenic agent used in the combination can be a pro- 
angiogenic cytokine (Desai and Libutti, J. Immunother., 22f3) :186-21 1 (1999)). 
1 0 For ecample, the pro-angiogenic cytokine used can be basic fibroblast growth 
factor such as bFGF and FGF-2, a vascular endothelial growth factor/vascular 
permeability factor such as VEGFA/PF and vasculotropin, a platelet-derived 
endothelial cell growth factor such as PD-EDGF and thymidine phosphorylase, a 
transforming growth factor-beta (TGF-^), or angiopoietin-1 {Ang-1). 
15 d. Anti-tumor agents and anti-angiogenic agents 

The compounds identified by the methods provided herein or provided 
herein can be used in combination with anti-tumor agents and/or anti- 
angiogenesis agents and/or pro-angiogenic agents. 

2. Formulations and route of administration 
20 The compounds herein and agents can be formulated as pharmaceutical 

compositions, typically for single dosage administration. The concentrations of 
the compounds in the formulations are effective for delivery of an amount, upon 
administration, that is effective for the intended treatment. Typically, the 
compositions are formulated for single dosage administration. To formulate a 
25 composition, the weight fraction of a compound or mixture thereof is dissolved, 
suspended, dispersed or otherwise mixed in a selected vehicle at an effective 
concentration such that the treated condition is relieved or ameliorated. 
Pharmaceutical carriers or vehicles suitable for administration of the compounds 
provided herein include any such carriers known to those skilled in the art to be 
30 suitable for the particular mode of administration. 

In addition, the compounds can be formulated as the sole 
pharmaceutically active ingredient in the composition or can be combined with 
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other active ingredients. Liposomal suspensions, including tissue-targeted 
liposomes, can also be suitable as pharmaceutically acceptable carriers. These 
can be prepared according to methods known to those skilled In the art. For 
example, liposome formulations can be prepared as described in U.S. Patent No. 
5 4,522,811. 

The active compound is included in the pharmaceutically acceptable 
carrier in an amount sufficient to exert a therapeutically useful effect in the 
absence of undesirable side effects on the patient treated. The therapeutically 
effective concentration can be determined empirically by testing the compounds 

10 In known in vitro and in vivo systems, such as the assays provided herein. 

The concentration of active compound in the drug composition depends 
on absorption, inactivation and excretion rates of the active compound, the 
physicochemical characteristics of the compound, the dosage schedule, and 
amount administered as well as other factors known to those of skill in the art. 

15 Typically a therapeutically effective dosage is contemplated. The 

amounts administered can be on the order of 0.001 to 1 mg/ml, including about 
0.005-0.05 mg/ml and about 0.01 mg/ml, of blood volume. Pharmaceutical 
dosage unit forms are prepared to provide from about 1 mg to about 1000 mg, 
including from about 10 to about 500 mg, and including about 25-75 mg of the 

20 essential active ingredient or a combination of essential ingredients per dosage 
unit form. The precise dosage can be empirically determined. 

The active ingredient can be administered at once, or can be divided into 
a number of smaller doses to be administered at intervals of time. It is 
understood that the precise dosage and duration of treatment is a function of the 

25 disease being treated and can be determined empirically using known testing 
protocols or by extrapolation from in vivo or in vitro test data. It is to be noted 
that concentrations and dosage values can also vary with the severity of the 
condition to be alleviated. It is to be further understood that for any particular 
subject, specific dosage regimens should be adjusted over time according to the 

30 individual need and the professional judgment of the person administering or 
supervising the administration of the compositions, and that the concentration 
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ranges set forth herein are exemplary only and are not intended to limit the 
scope or use of the claimed compositions and combinations containing them. 

Pharmaceutically acceptable derivatives include acids, salts, esters, 
hydrates, solvates and prodrug forms. The derivative is typically selected such 
5 that its pharmacokinetic properties are superior to thB corresponding neutral 
compound. 

Thus, effective concentrations or amounts of one or more of the 
compounds provided herein or pharmaceutically acceptable derivatives thereof 
are mixed with a suitable pharmaceutical carrier or vehicle for systemic, topical 

10 or local administration to form pharmaceutical compositions. Compounds are 
included in an amount effective for ameliorating or treating the disorder for 
which treatment Is contemplated. The concentration of active compound in the 
composition depends on absorption, inacttvation, excretion rates of the active 
compound, the dosage schedule, amount administered, particular formulation as 

1 5 well as other factors known to those of skill in the art. 

Solutions or suspensions used for parenteral, intradermal, subcutaneous, 
or topical application can include any of the following components: a sterile 
diluent, such as water for injection, saline solution, fixed oil, polyethylene glycol, 
glycerine, propylene glycol or other synthetic solvent; antimicrobial agents, such 

20 as benzyl alcohol and methyl parabens; antioxidants, such as ascorbic acid and 
sodium bisulfite; chelating agents, such as ethylenediaminetetraacetic acid 
(EDTA); buffers, such as acetates, citrates and phosphates; and agents for the 
adjustment of tonicity such as sodium chloride or dextrose. Parenteral 
preparations can be enclosed in ampules, disposable syringes or single or 

25 multiple dose vials made of glass, plastic or other suitable material. 

In instances In which the compounds exhibit insufficient solubility, 
methods for solubilizing compounds can be used. Such methods are known to 
those of skill in this art, and include, but are not limited to, using cosolvents, 
such as dimethylsulfoxide (DIVISG), using surfactants, such as Tween®, or 

30 dissolution in aqueous sodium bicarbonate. Derivatives of the compounds, such 
as prodrugs of the compounds can also be used in formulating effective 
pharmaceutical compositions. For ophthalmic indications, the compositions are 
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formulated in an ophthalmically acceptable carrier. For the ophthalmic uses 
herein, local administration, either by topical administration or by injection are 
contemplated. Time release formulations are also desirable. Typically, the 
compositions are formulated for single dosage administration, so that a single 
5 dose administers an effective amount. 

Upon mixing or addition of the compound with the vehicle, the resulting 
mixture can be a solution, suspension, emulsion or other composition. The form 
of the resulting mixture depends upon a number of factors, including the 
intended mode of administration and the solubility of the compound in the 

10 selected carrier or vehicle. If necessary, pharmaceutically acceptable salts or 
other derivatives of the compounds are prepared. 

The compound is included in the pharmaceutically acceptable carrier in an 
amount sufficient to exert a therapeutically useful effect in the absence of 
undesirable side effects on the patient treated. It is understood that number and 

1 5 degree of side effects depends upon the condition for which the compounds are 
administered. For example, certain toxic and undesirable side effects are 
tolerated when treating life-threatening illnesses that would not be tolerated 
when treating disorders of lesser consequence. 

The compounds also can be mixed with other active materials, that do 

20 not impair the desired action, or with materials that supplement the desired 
action known to those of skill in the art. The formulations of the compounds 
and agents for use herein include those suitable for oral, rectal, topical, 
inhalational, buccal {e.g., sublingual), parenteralje.^., subcutaneous, 
intramuscular, intradermal, or intravenous), transdermal administration or any 

25 route. The most suitable route in any given case depends on the nature and 

severity of the condition being treated and on the nature of the particular active 
compound which is being used. The formulations are provided for administration 
to humans and animals in unit dosage forms, such as tablets, capsules, pills, 
powders, granules, sterile parenteral solutions or suspensions, and oral solutions 

30 or suspensions, and oil-water emulsions containing suitable quantities of the 
compounds or pharmaceutically acceptable derivatives thereof. The 
pharmaceutically therapeutically active compounds and derivatives thereof are 
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typicatly formulated and administered in unit-dosage forms or multiple-dosage 
forms. Unit-dose forms as used herein refers to physically discrete units suitable 
for human and animal subjects and packaged individually as is known in the art. 
Each unit-dose contains a predetermined quantity of the therapeutically active 
5 compound sufficient to produce the desired therapeutic effect, in association 
with the required pharmaceutical carrier, vehicle or diluent. Examples of 
un'rt-dose forms include ampoules and syringes and individually packaged tablets 
or capsules. Unit-dose forms can be administered in fractions or multiples 
thereof. A multiple-dose form is a plurality of identical unit-dosage forms 

10 packaged in a single container to be administered in segregated unit-dose form. 
Examples of multiple-dose forms include vials, bottles of tablets or capsules or 
bottles of pints or gallons. Hence, multiple dose form is a multiple of unit-doses 
which are not segregated in packaging. 

The composition can contain along with the active Ingredient: a diluent 

1 5 such as lactose, sucrose, dicalcium phosphate, or carboxymethylcellulose; a 
lubricant, such as magnesium stearate, calcium stearate and talc; and a binder 
such as starch, natural gums, such as gum acacia, gelatin, glucose, molasses, 
polvinylpyrrolidine, celluloses and derivatives thereof, povidone, crospovidones 
and other such binders known to those of skill in the art. Liquid 

20 pharmaceutically administrable compositions can, for example, be prepared by 
dissolving, dispersing, or otherwise mixing an active compound as defined above 
and optional pharmaceutical adjuvants in a carrier, such as, for example, water, 
saline, aqueous dextrose, glycerol, glycols, ethanol, and the like, to thereby form 
a solution or suspension. If desired, the pharmaceutical composition to be 

25 administered can also contain minor amounts of nontoxic auxiliary substances 
such as wetting agents, emulsifying agents, or solubilizing agents, pH buffering 
agents and the like, for example, acetate, sodium citrate, cyclodextrine 
derivatives, sorbitan monolaurate, triethanolamine sodium acetate, 
triethanolamine oleate, and other such agents. Methods of preparing such 

30 dosage forms are known, or will be apparent, to those skilled in this art (see, 
e.g.. Remington's Pharmaceutical Sciences, Mack Publishing Company, Easton, 
Pa., 15th Edition, 1975). The composition or formulation to be administered 
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contains a quantity of the active compound in an amount sufficient to alleviate 
the symptoms of the treated subject. 

Dosage forms or compositions containing active ingredient in the range of 
0.005% to 100% with the balance made up from non-toxic carrier can be 
5 prepared. For oral administration, the pharmaceutical compositions can take the 
form of, for example, tablets or capsules prepared by conventional means with 
pharmaceutlcally acceptable excipients such as binding agents {e.g., 
pregelatinized maize starch, polyvinyl pyrrolidone or hydroxypropyl 
methylcellulose); fillers {e.g., lactose, microcrystalline cellulose or calcium 
10 hydrogen phosphate); lubricants {e.g., magnesium stearate, talc or silica); 

disintegrants {e.g., potato starch or sodium starch glycolate); or wetting agents 
{e.g., sodium lauryl sulfate). The tablets can be coated by methods well-known 
in the art. 

The pharmaceutical preparation can also be in liquid form, for example, 
1 5 solutions, syrups or suspensions, or can be presented as a drug product for 
reconstitution with water or other suitable vehicle before use. Such liquid 
preparations can be prepared by conventional means with pharmaceutically 
acceptable additives such as suspending agents (e.flr., sorbitol syrup, cellulose 
derivatives or hydrogenated edible fats); emulsifying agents {e.g., lecithin or 
20 acacia); non-aqueous vehicles {e.g., almond oil, oily esters, or fractionated 

vegetable oils); and preservatives {e.g., methyl or propyl-p-hydroxybenzoates or 
sorbic acid). 

Formulations suitable for rectal administration can be presented as unit 
dose suppositories. These can be prepared by admixing the active compound 

25 with one or more conventional solid carriers, for example, cocoa butter, and then 
shaping the resulting mixture. 

Formulations suitable for topical application to the skin or to the eye 
generally are formulated as an ointment, cream, lotion, paste, gel, spray, aerosol 
and oil. Carriers which can be used include vaselineT lanoline, polyethylene 

30 glycols, alcohols, and combinations of two or more thereof. The topical 

formulations can further advantageously contain 0.05 to 1 5 percent by weight 
of thickeners selected from among hydroxypropyl methyl cellulose, methyl 
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cellulose, polyvinylpyrrolidone, polyvinyl alcohol, poly (alkylene glycols), 
poly/hydroxyalkyl, (meth)acrylates or poly(meTh)acrylamides. A topical 
formulation is often applied by instillation or as an ointment into the conjunctival 
sac. It also can be used for irrigation or lubrication of the eye. facial sinuses, 
5 and external auditory meatus. It can also be injected into the anterior eye 
chamber and other places. The topical formulations in the liquid state can be 
also present in a hydrophilic three-dimensional polymer matrix in the form of a 
strip, contact lens, and the like from which the active components are released. 
For administration by inhalation, the compounds for use herein can be 

10 delivered in the form of an aerosol spray presentation from pressurized packs or 
a nebulizer, with the use of a suitable propellant, e.g., dichlorodifluoromethane, 
trichlorof luoromethane, dichlorotetrafluoroethane, carbon dioxide or other 
suitable gas. In the case of a pressurized aerosol, the dosage unit can be 
determined by providing a valve to deliver a metered amount. Capsules and 

15 cartridges of, e,g., gelatin, for use in an inhaler or Insufflator can be formulated 
containing a powder mix of the compound and a suitable powder base such as 
lactose or starch. 

Formulations suitable for buccal (sublingual) administration include, for 
example, lozenges containing the active compound in a flavored base, usually 

20 sucrose and acacia or tragacanth; and pastilles containing the compound in an 
inert base such as gelatin and glycerin or sucrose and acacia. 

The compounds can be formulated for parenteral administration by 
injection, e.g., by bolus injection or continuous infusion. Formulations for 
injection can be presented In unit dosage form, e.g.. in ampules or in multi-dose 

25 containers, wrth an added preservative. The compositions can be suspensions, 
solutions or emulsions in oily or aqueous vehicles, and can contain formulatory 
agents such as suspending, stabilizing and/or dispersing agents. Alternatively, 
the active Ingredient can be In powder form for reconstitution with a suitable 
vehicle, e.g., sterile pyrogen-free water or other solvents, before use. 

30 Formulations suitable for transdermal administration can be presented as 

discrete patches adapted to remain in intimate contact with the epidermis of the 
recipient for a prolonged period of time. Such patches suitably contain the 
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active compound as an optionally buffered aqueous solution of, for example, 0.1 
to 0.2 M concentration with respect to the active compound. Formulations 
suitable for transdermal administration can also be delivered by iontophoresis 
[see, e.g.. Pharmaceutical Research 3 (6), 318 (1986)) and typically take the 
5 form of an optionally buffered aqueous solution of the active compound. 

The pharmaceutical compositions can also be administered by controlled 
release means and/or delivery devices (see, e.g., in U.S. Patent Nos. 3,536,809; 
3,598,123; 3,630,200; 3,845,770; 3,847,770; 3,916,899; 4,008,719; 
4,687,610; 4,769,027; 5,059,595; 5,073,543; 5,120,548; 5,354,566; 
10 5,591,767; 5,639,476; 5,674,533 and 5,733,566). 

Desirable blood levels can be maintained by a continuous infusion of the 
active agent as ascertained by plasma levels. It should be noted that the 
attending physician would know how to and when to terminate, interrupt or 
adjust therapy to lower dosage due to toxicity, or bone marrow, liver or kidney 
1 5 dysfunctions. Conversely, the attending physician would also know how to and 
when to adjust treatment to higher levels if the clinical response is not adequate 
(precluding toxic side effects). 

The efficacy and/or toxicity of the MTSP20 polypeptide inhibitorls), alone 
or in combination with other agents also can be assessed by the methods known 
20 in the art (see generally, O'Reilly, Investigational New Drugs, /5:5-13 (1997)). 

The active compounds or pharmaceutically acceptable derivatives can be 
prepared with carriers that protect the compound against rapid elimination from 
the body, such as time release formulations or coatings. 

Kits containing the compositions and/or the combinations with 
25 instructions for administration thereof are provided. The kit can further include a 
needle or syringe, typically packaged in sterile form, for injecting the complex, 
and/or a packaged alcohol pad. Instructions are optionally included for 
administration of the active agent by a clinician or by the patient. 

Finally, the compounds or MTSP20 polypeptides or protease domains 
30 thereof or compositions containing any of the preceding agents can be packaged 
as articles of manufacture containing packaging material, a compound or suitable 
derivative thereof provided herein, which is effective for treatment of diseases or 
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disorders contemplated herein, within the packaging nnaterial, and a label that 
indicates that the compound or a suitable derivative thereof is for treating the 
diseases or disorders contemplated herein. The label can optionally include the 
disorders for which the therapy is warranted. 
5 L. Methods of treatment 

The compounds identified by the methods herein are used for treating or 
preventing neoplastic diseases in an animal, particularly a mammal. Including a 
human, as provided herein. In one embodiment, the method includes 
administering to a mammal an effective amount of an inhibitor of an MTSP20 

10 polypeptide, whereby the disease or disorder is treated or prevented. 

In an embodiment, the MTSP20 polypeptide inhibitor used in the 
treatment or prevention is administered with a pharmaceutically acceptable 
carrier or excipient. The mammal treated can be a human. The inhibitors 
provided herein are those identified by the screening assays. In addition, 

1 5 antibodies and antisense nucleic acids or double-stranded RNA (dsRNA). such as 
RNAi, are contemplated. 

The treatment or prevention method can further include administering an 
anti-angiogenic treatment or agent or anti-tumor agent simultaneously with, prior 
to or subsequent to the MTSP20 polypeptide inhibitor, which can be any 

20 compound identified that inhibits an activity of an MTSP20 polypeptide. Such 
compounds include small molecule modulators, an antibody or a fragment or 
derivative thereof containing a binding region thereof against the MTSP20 
polypeptide, an antisense nucleic acid or double-stranded RNA (dsRNA), such as 
RNAi, encoding an a portion of the MTSP20 polypeptide or complementary to 

25 thereto, and a nucleic acid containing at least a portion of a gene encoding the 
MTSP20 polypeptide into which a heterologous nucleotide sequence has been 
inserted such that the heterologous sequence inactivates the biological activity 
of at least a portion of the gene encoding the MTSP20 polypeptide, in which the 
portion of the gene encoding the MTSP20 polypeptide flanks the heterologous 

30 sequence to promote homologous recombination with a genomic gene encoding 
the MTSP20 polypeptide. In addition, such molecules are generally less than 
about 1000 nt long. 
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1 . Antisense treatment 

In a specific embodiment, as described hereinabove, MTSP20 polypeptide 
function is reduced or inhibited by MTSP20 polypeptide antisense nucleic acids, 
to treat or prevent neoplastic disease. The therapeutic or prophylactic use of 
5 nucleic acids of at least six nucleotides, generally up to about 1 50 nucleotides, 
that are antisense to a gene or cDNA encoding MTSP20 polypeptide or a portion 
thereof is provided. An MTSP20 polypeptide "antisense" nucleic acid as used 
herein refers to a nucleic acid capable of hybridizing to a portion of an MTSP20 
polypeptide RNA {generally mRNA) by virtue of some sequence complementarity, 

10 and generally under high stringency conditions. The antisense nucleic acid can 
be complementary to a coding and/or noncoding region of an MTSP20 
polypeptide mRNA. Such antisense nucleic acids have utility as therapeutics 
that reduce or inhibit MTSP20 polypeptide function, and can be used in the 
treatment or prevention of disorders as described supra, 

1 5 The MTSP20 polypeptide antisense nucleic acids are of at least six 

nucleotides and are generally oligonucleotides (ranging from 6 to about 1 50 
nucleotides including 6 to 50 nucleotides). The antisense molecule can be 
complementary to all or a portion of a protease domain. For example, the 
oligonucleotide is at least 1 0 nucleotides, at least 1 5 nucleotides, at least 1 00 

20 nucleotides, or at least 125 nucleotides. The oligonucleotides can be DIMA or 
RNA or chimeric mixtures or derivatives or modified versions thereof, single- 
stranded or double-stranded. The oligonucleotide can be modified at the base 
moiety, sugar moiety, or phosphate backbone. The oligonucleotide can include 
other appending groups such as peptides, or agents facilitating transport across 

25 the cell membrane (see, e.g., Letsinger et al., Proc. Natl. Acad. Set. U.S.A. 

55:6553-6556 (1989); Lemaitre et at., Proc. NatL Acad. Sci U.S.A. 54:648-652 
(1987); PCT Publication No. WO 88/09810, published December 15, 1988) or 
blood-brain barrier (see, e.g.. PCT Publication No. WO 89/10134, published April 
25. 1988), hybridization-triggered cleavage agents (see, e.g., Krol et al., 

30 BioTechniques 5:958-976 (1988)) or intercalating agents (see, e.g., Zon, Pharm. 
Res. 6:539-549 (1988)). 
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The MTSP20 polypeptide antisense nucleic acid generally is an oligo- 
nucleotide, typically single-stranded DNA or RNA or an analog thereof or 
mixtures thereof. For example, the oligonucleotide includes a sequence 
antisense to a portion of a nucleic acid that encodes a human MTSP20 
5 polypeptide. The oligonucleotide can be modified at any position on its structure 
with substituents generally known in the art. 

The MTSP20 polypeptide antisense oligonucleotide can include at least 
one modified base moiety which is selected from the group including, but not 
limited to 5-fluorouracll, 5-bromouracit, 5-chlorouracil, 5-iodouracil, 
10 hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 
5-carboxymethyiaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, 
dihydrouracil, beta-O-galactosylqueosine, inoslne, N6-isopentenyladenine, 

1- methylguaninc, 1 -methylinosine, 2,2-dimethylguanine, 2-methyladenine, 

2- methyIguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 

1 5 7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl- 
2-thiouracil, beta-D-mannosylqueosine, 5 '-methoxycarboxymethyluracil, 
5-methoxyuracil, 2-methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), 
wybutoxosine, pseudouracil. queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 
2-thiouracil, 4-thtouracil, 5-methyluracil, uracil-5-oxyacetic acid methylester, 

20 3-(3-amino-3-N-2-carboxypropyll uracil. (acp3)w, and 2,6-diaminopurine. 

In another embodiment, the oligonucleotide includes at least one modified 
sugar moiety selected from the group including but not limited to arabinose, 
2-fluoroarabinose, xylulose, and hexose. The oligonucleotide can include at least 
one modified phosphate backbone selected from a phosphorothioate, a 

25 phosphorodithioate, a phosphoramidothioate, a phosphoramidate, a 

phosphordiamidate, a methylphosphonate, an alkyi phosphotriester, and a 
formacetal or analog thereof. 

The oligonucleotide can be an a-anomeric oligonucleotide. An o-anomeric 
oligonucleotide forms specific double-stranded hybrids with complementary RNA 

30 in which the strands run parallel to each other (Gautier et al., Nuct. Acids Res. 
75:6625-6641 (1987)}. 
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The oligonucleotide can be conjugated to another molecule, such as, but 
are not limited to, a peptide, hybridization triggered cross-linking agent, transport 
agent or a hybridization-triggered cleavage agent- The oligonucleotides can be 
synthesized by standard methods known in the art, e.g. by use of an automated 
5 DNA synthesizer (such as are commercially available from Biosearch, Applied 
Biosystems, etc.). As examples, phosphorothioate oligonucleotides can be 
synthesized by the method of Stein et al. {NucL Acids Res. 75:3209 (1988)), 
methylphosphonate oligonucleotides can be prepared by use of controlled pore 
glass polymer supports (Sarin et al., Proc. Natl. Acad. Sci. U.S.A. S5:7448-7451 

10 (1988)), etc. 

In a specific embodiment, the MTSP20 polypeptide antisense 
oligonucleotide includes catalytic RNA or a ribozyme (see, e.g., PCT International 
Publication WO 90/1 1364, published October 4, 1990; Sarver et al., Science 
247:1222-1225 (1990)), In another embodiment, the oligonucleotide is a 2'-0- 

1 5 methylrlbonucleotide (Inoue et a!., Nucl. Acids Res. 75:61 31 -61 48 (1 987)), or a 
chimeric RNA-DNA analogue (Inoue et a!., FEBS Lett. 275:327-330 (1987)). 
Alternatively, the oligonucleotide can be double-stranded RNA (dsRNA) such as 
RNAi. 

In an alternative embodiment, the MTSP20 polypeptide antisense nucleic 
20 acid is produced intracetlularly by transcription from an exogenous sequence. 

For example, a vector can be introduced in vivo such that it is taken up by a cell, 
within which cell the vector or a portion thereof is transcribed, producing an 
antisense nucleic acid (RNA), Such a vector would contain a sequence encoding 
the IV1TSP20 polypeptide antisense nucleic acid. Such a vector can remain 
25 episomal or become chromosomally integrated, as long as it can be transcribed 
to produce the desired antisense RNA. Such vectors can be constructed by 
recombinant DNA technology methods standard in the art. Vectors can be 
plasmid, viral, or others known in the art, used for replication and expression in 
mammalian cells. Expression of the sequence encoding the MTSP20 polypeptide 
30 antisense RNA can be by any promoter known in the art to act in mammalian, 
including human, cells. Such promoters can be inducible or constitutive. Such 
promoters include but are not limited to: the SV40 early promoter region 
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(Bernoist and Chambon, Nature 230:304-310 (1981), the promoter contained in 
the 3' long terminal repeat of Rous sarcoma virus (Yamamoto et al., Ce// 22:787- 
797 (1980), the herpes thymidine kinase promoter (Wagner et al., Proc. Nat/. 
Acad. Sci. U.S.A. 75:1441-1445 (1981), the regulatory sequences of the 
5 metallothionein gene (Brinster et al.. Nature 255:39^2 (1982), etc. 

The antisense nucleic acids include sequence complementary to at least a 
portion of an RNA transcript of an MTSP20 polypeptide gene, including a human 
MTSP20 polypeptide gene. Absolute complementarity is not required. 

The amount of MTSP20 polypeptide antisense nucleic acid that is 

1 0 effective in the treatment or prevention of neoplastic disease depends on the 
nature of the disease, and can be determined empirically by standard clinical 
techniques. Where possible, it is desirable to determine the antisense 
cytotoxicity in cells tn vitro, and then in useful animal model systems prior to 
testing and use in humans. 

15 2. RNA interference 

RNA interference (RNAi) (see, e.g. Chuang et al. (2000) Proc. Natl. Acad. 
Sci. U.S.A. 57:4985) can be employed to inhibit the expression of a gene 
encoding an MTSP20. Interfering RNA (RNAi) fragments, particularly double- 
stranded (ds) RNAi, can be used to generate loss-of-MTSP20 function. Methods 

20 relating to the use of RNAi to silence genes in organisms including, mammals, C. 
elegans, Drosophila and plants, and humans are known (see, e.g., Fire et at. 
(1998) Nature 3S/:806-81 1 Fire (1999) Trends Genet. r5:358-363; Sharp 
(2001) Genes Dev. 75:485-490; Hammond, et at. (2001) Nature Rev. 
Genet,2'^^0'^ 119; TuschI (2001) Chem, Biochem. 2:239-245; Hamilton et at. 

25 (1999) Science 256:950-952; Hammond et al. (2000) Nature 404:293-296; 
Zamore et al. (2000) Cell ;07:25-33; Bernstein et al. (2001) Nature 409: 363- 
366; Elbashir et al. (2001) Genes Dev. /5:188-200; Elbashir et al. (2001) Nature 
47 7:494-498; International PCT application No. WO 01/29058; International 
PCT application No. WO 99/32619). 

30 Double-stranded RNA (dsRNA)-expressing constructs are introduced into 

a host, such as an animal or plant using, a replicable vector that remains 
episomal or integrates into the genome. By selecting appropriate sequences. 
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expression of dsRNA can interfere with accumulation of endogenous mRNA 
encoding an MTSP20. RNAi also can be used to inhibit expression in vitro. 
Regions including at least about 21 (or 21) nucleotides that are selective (i.e. 
unique) for MTSP20 are used to prepare the RNAi. Smaller fragments of about 
5 21 nucleotides can be transformed directly {i.e.. in vitro or in vivo) into cells; 
larger RNAi dsRNA molecules are generally introduced using vectors that encode 
them. dsRNA molecules are at least about 21 bp long or longer, such as 50, 
100, 150, 200 and longer. Methods, reagents and protocols for introducing 
nucleic acid molecules into cells in vitro and in vivo are known to those of skill in 
10 the art. 

3. Gene Therapy 

In an exemplary embodiment, nucleic acids that include a sequence of 
nucleotides encoding an MTSP20 polypeptide or functional domains or derivative 
thereof, are administered to promote MTSP20 polypeptide function, by way of 

1 5 gene therapy. Gene therapy refers to therapy performed by the administration of 
a nucleic acid to a subject. In this embodiment, the nucleic acid produces its 
encoded protein that mediates a therapeutic effect by promoting MTSP20 
polypeptide function. Any of the methods for gene therapy available in the art 
can be used (see, Goldspiel et ai., Ciinicai Pharmacy r2:488-505 (1993); Wu 

20 and Wu, Btotherapy 3:87-95 (1991); Tolstoshev, An. Rev. Pharmacol. Toxicol. 
32:573-596 (1993); Mulligan, Science 250:926-932 (1993); and Morgan and 
Anderson, An. Rev. Biochem. 62:191-217 (1993); TIBTBCH 7 /r5>l: 1 55-21 5 
(1993). 

For example, one therapeutic composition for gene therapy includes an 
25 MTSP20 polypeptide-encoding nucleic acid that is part of an expression vector 
that expresses an MTSP20 polypeptide or domain, fragment or chimeric protein 
thereof in a suitable host. In particular, such a nucleic acid has a promoter 
operably linked to the MTSP20 polypeptide coding region, the promoter being 
inducible or constitutive, and, optionally, tissue-specific. In another particular 
30 embodiment, a nucleic acid molecule is used in which the MTSP20 polypeptide 
coding sequences and any other desired sequences are flanked by regions that 
promote homologous recombination at a desired site in the genome, thus 
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providing for intrachromosomal expression of the SP protein nucleic acid (Koller 
and Smithies, Proc. Natl, Acad. Sci. USA 55:8932-8935 (1989); Zijistra et al.. 
Nature 342:435-438 (1 989)). 

Delivery of the nucleic acid into a patient can be either direct, in which 
5 case the patient is directly exposed to the nucleic acid or nucleic acid-carrying 
vector, or indirect. In which case, cells are first transformed with the nucleic acid 
in vitro, then transplanted into the patient. These two approaches are known, 
respectively, as in vivo or ex vivo gene therapy. 

in a specific embodiment, the nucleic acid is directly administered in vivo. 

1 0 where it is expressed to produce the encoded product. This can be 
accomplished by any of numerous methods known in the art, e.g., by 
constructing it as part of an appropriate nucleic acid expression vector and 
administering it so that it becomes intracellular, e.g., by infection using a 
defective or attenuated retroviral or other viral vector (see U.S. Patent No. 

15 4,980,286), or by direct injection of naked DNA, or by use of microparticle 

bombardment (e.^., a gene gun; Biolistic, Dupont], or coating with lipids or cell- 
surface receptors or transfecting agents, encapsulation in liposomes, 
microparticles, or microcapsules, or by administering it in linkage to a peptide 
which Is known to enter the nucleus, by administering it in linkage to a ligand 

20 subject to receptor-mediated endocytosis (see e.g., Wu and Wu, J. Biol. Chem. 
252:4429-4432 (1987)) (which can be used to target cell types specifically 
expressing the receptors), etc. In another embodiment, a nucleic acid-ligand 
complex can be formed in which the ligand is a fusogenic viral peptide to disrupt 
endosomes, allowing the nucleic acid to avoid lysosomal degradation. In yet 

25 another embodiment, the nucleic acid can be targeted in vivo for cell specific 
uptake and expression, by targeting a specific receptor (see, e.g., PCT 
Publications WO 92/06180 dated April 16, 1992 (Wu et al.); WO 92/22635 
dated December 23, 1992 (Wilson et al.); W092/2031 6 dated November 26, 
1992 (Findeis et al.); W093yi4188 dated July 22, 1993 (Clarke et a!.), WO 

30 93/20221 dated October 14, 1993 (Young)). Alternatively, the nucleic acid can 
be introduced intracellularly and incorporated within host cell DNA for 
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expression, by homologous recombination (Kotler and Smithies, Proc. Natl. Acad. 
Set. USA 86:8932-8935 (1989); Zijistra et al.. Nature 342:435-438 (1989)). 

In a specific embodiment, a viral vector that contains the MTSP20 
polypeptide nucleic acid is used. For example, a retroviral vector can be used 
5 (see Miller et al., Meth. Enzymol. 27 7:581-599 (1993)). These retroviral vectors 
have been modified to delete retroviral sequences that are not necessary for 
packaging of the viral genome and integration into host cell DNA. The MTSP20 
polypeptide nucleic acid to be used in gene therapy is cloned into the vector, 
which facilitates delivery of the gene into a patient. More detail about retroviral 

10 vectors can be found in Boesen et al., Biotherapy ff:291-302 (1994), which 

describes the use of a retroviral vector to deliver the mdrl gene to hematopoietic 
stem cells in order to make the stem cells more resistant to chemotherapy. 
Other references illustrating the use of retroviral vectors in gene therapy are: 
Clowes et al., J. Clin, Invest 53:644-651 (1994); Kiem et al., Bfood 53:1467- 

15 1473 (1994); Salmons and Gunzberg, Human Gene Therapy 4:129-141 (1993); 
and Grossman and Wilson, Curr. Opin. in Genetics and Devei 3:1 10-1 14 
(1993). 

Adenoviruses are other viral vectors that can be used in gene therapy. 
Adenoviruses are especially attractive vehicles for delivering genes to respiratory 

20 epithelia. Adenoviruses naturally infect respiratory epithelia where they cause a 
mild disease. Other targets for adenovirus-based delivery systems are liver, the 
central nervous system, endothelial cells, and muscle. Adenoviruses have the 
advantage of being capable of infecting non-dividing cells. Kozarsky and Wilson, 
Current Opinion in Genetics and Development 3:499-503 (1993) present a 

25 review of adenovirus-based gene therapy. Bout et al., Human Gene Therapy 
5:3-10 (1994) demonstrated the use of adenovirus vectors to transfer genes to 
the respiratory epithelia of rhesus monkeys. Other instances of the use of 
adenoviruses in gene therapy can be found in Rosenfeld et al.. Science 252'A2\- 
434 (1991); Rosenfeld et al., Ce// 65:143-1 55 (1992); and Mastrangeli et aL, J. 

30 din. invest. 5/:225-234 (1993). 

Adeno-associated virus (AAV) has also been proposed for use in gene 
therapy (Walsh et af. (1993) Proc. Soc. Exp. Biol. Med. 204:289-300). 
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Another approach to gene therapy involves transferring a gene to cells in 
tissue culture by such methods as electroporation, lipofectlon, calciunfi 
phosphate mediated transfection, or viral Infection. Usually, the method of 
transfer includes the transfer of a selectable marker to the cells. The cells are 
5 then placed under selection to isolate those cells that have taken up and are 
expressing the transferred gene. Those cells are then delivered to a patient. 

In this embodiment, the nucleic acid is Introduced into a cell prior to 
administration in vivo of the resulting recombinant cell. Such introduction can 
be carried out by any method known in the art, including but not limited to 

10 transfection, electroporation, microinjection, infection with a viral or 
bacteriophage vector containing the nucleic acid sequences, cell fusion, 
chromosome-mediated gene transfer, microcell-mediated gene transfer, 
spheroplast fusion, etc. Numerous techniques are known in the art for the 
introduction of foreign genes Into cells (see e.g., Loeffler and Behr, Meth. 

15 BnzymoL 2/ 7:599-618 (1993); Cohen et al., Meth, Enzymoi 27 7:618-644 

(1993); Cline, Pharmac. Then 25:69-92 (1985)) and can be used, provided that 
the necessary developmental and physiological functions of the recipient cells 
are not disrupted. The technique should provide for the stable transfer of the 
nucleic acid to the cell, so that the nucleic acid is expressible by the cell and 

20 generally heritable and expressible by its cell progeny. 

The resulting recombinant cells can be delivered to a patient by various 
methods known in the art. In an embodiment, epithelial cells are injected, e.g., 
subcutaneously. In another embodiment, recombinant skin cells can be applied 
as a skin graft onto the patient. Recombinant blood cells [e.g., hematopoietic 

25 stem or progenitor cells) can be administered intravenously. The amount of cells 
envisioned for use depends on the desired effect, patient state, etc., and can be 
determined by one skilled in the art. 

Cells into which a nucleic acid can be introduced for purposes of gene 
therapy encompass any desired, available cell type, and include but are not 

30 limited to epithelial cells, endothelial cells, keratinocytes, fibroblasts, muscle 
cells, hepatocytes; blood cells such as T lymphocytes, B lymphocytes, 
monocytes, macrophages, neutrophils, eosinophils, megakaryocytes. 
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granulocytes; various stem or progenitor cells, in particular hematopoietic stem 
or progenitor cells, e.g., such as stem cells obtained from bone marrow, 
umbilical cord blood, peripheral blood, fetal liver, and other sources thereof. 

For example, a celt used for gene therapy is autologous to the patient. In 
5 an embodiment in which recombinant cells are used in gene therapy, an MTSP20 
polypeptide nucleic acid is introduced into the celts such that it is expressible by 
the cells or their progeny, and the recombinant cells are then administered in 
vivo for therapeutic effect. In a specific embodiment, stem or progenitor cells 
are used. Any stem and/or progenitor cells which can be isolated and 

10 maintained in vitro can potentially be used In accordance with this embodiment. 
Such stem celts include but are not limited to hematopoietic stem cells (HSC), 
stem cells of epithelial tissues such as the skin and the lining of the gut, 
embryonic heart muscle cells, liver stem cells (PCT Publication WO 94/08598, 
dated April 28, 1994), and neural stem cells (Stemple and Anderson, Ceil 

15 77:973-985 (1992)}. 

Epithelial stem cells (ESCs) or keratinocytes can be obtained from tissues 
such as the skin and the lining of the gut by known procedures (Rhelnwald, 
Meth. Cell Bio. 27A:229 (1980)). In stratified epithelial tissue such as the skin, 
renewal occurs by mitosis of stem cells within the germinal layer, the layer 

20 closest to the basal lamina. Stem cells within the lining of the gut provide for a 
rapid renewal rate of this tissue. ESCs or keratinocytes obtained from the skin 
or lining of the gut of a patient or donor can be grown in tissue culture 
(Rheinwald, /Wer/?. Cell Bio. 2M:229 (1980); Pittelkow and Scott, Cano Clinic 
Pfoc. 61\11\ (1986)). If the ESCs are provided by a donor, a method for 

25 suppression of host versus graft reactivity {e.g., irradiation, drug or antibody 
administration to promote moderate immunosuppression) also can be used. 

With respect to hematopoietic stem cells (HSC), any technique which 
provides for the isolation, propagation, and maintenance in vitro of HSC can be 
used in this embodiment. Techniques by which this can be accomplished 

30 include (a) the isolation and establishment of HSC cultures from bone marrow 
cells isolated from the future host, or a donor, or (b) the use of previously 
established long-term HSC cultures, which can be allogeneic or xenogeneic. 
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Non-autologous HSC generally are used with a method of suppressing 
transplantation immune reactions of the future host/patient. In a particular 
embodiment, human bone marrow cells can be obtained from the posterior iliac 
crest by needle aspiration {see, e.g., Kodo et at., J. Clin. Invest. 7J: 1377-1 384 
5 (1984)). For example, the HSCs can be made highly enriched or in substantially 
pure form. This enrichment can be accomplished before, during, or after long- 
term culturing, and can be done by any techniques known in the art. Long-term 
cultures of bone marrow cells can be established and maintained by using, for 
example, modified Dexter cell culture techniques (Dexter et al., J. Ce// Physio f. 

10 57:335 (1977) or Wit! ock- Witt e culture techniques (Witlock and Witte, Proc. 
Natl. Acad. Sci. USA 75:3608-3612 (1982)). 

In a specific embodiment, the nucleic acid to be introduced for purposes 
of gene therapy includes an inducible promoter operably linked to the coding 
region, such that expression of the nucleic acid is controllable by controlling the 

15 presence or absence of the appropriate inducer of transcription. 
4. Prodrugs 

A method for treating tumors is provided. The method is practiced by 
administering a prodrug that is cleaved at a specific site by an MTSP20 to 
release an active drug or precursor that can be converted to active drug in vivo. 

20 Upon contact with a cell that expresses MTSP20 activity, the prodrug is 

converted into an active drug. The prodrug can be a conjugate that contains the 
active agent, such as an anti-tumor drug, such as a cytotoxic agent, or other 
therapeutic agent (TA), linked to a substrate for the targeted MTSP20, such that 
the drug or agent is inactive or unable to enter a cell, in the conjugate, but is 

25 activated upon cleavage. The prodrug, for example, can contain an oligopeptide, 
typically a relatively short, less than about 10 amino acids peptide, that is 
proteolytically cleaved by the targeted MTSP20. Cytotoxic agents, include, but 
are not limited to, alkylating agents, antiproliferative agents and tubulin binding 
agents. Others cytotoxic acids include, but are not limited to, vinca drugs, 

30 mitomycins, bleomycins and taxanes. 
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5. Treatment of undesired angiogenesis 

Compounds and combinations provided herein can be used for treating or 
preventing a disease or disorder^ including cancers and ophthalmic disorders and 
restenosis, associated with undesired and/or uncontrolled angiogenesis in a 
5 mammal. Methods using such compounds and combinations are provided. In 
one embodiment, the method includes administering to a mamma! an effective 
amount of an inhibitor of the MTSP20, which is an endotheliase, whereby the 
disease or disorder is treated or prevented. In another embodiment, the 
endotheliase inhibitor used in the treatment or prevention is administered with a 
10 pharmaceutically acceptable carrier or excipient. The mammal treated can be a 
human. 

In other embodiments, the treatment or prevention method further 
includes administering an anti-angiogenic treatment or agent simultaneously 
with, prior to or subsequent to the MTSP20 inhibitor, which can be any 

1 5 compound identified that inhibits an activity of MTSP20 expressed on endothelial 
cells, and includes an antibody or a fragment or derivative thereof containing the 
binding region thereof against the MTSP20 expressed on endothelial cells, an 
antisense nucleic acid encoding the MTSP20, and a nucleic acid containing at 
least a portion of a gene encoding the MTSP20. which is expressed on 

20 endothelial cells, into which a heterologous nucleotide sequence has been 

inserted such that the heterologous sequence inactivates the biological activity 
of at least a portion of the gene encoding the MTSP20, which is expressed on 
endothelial cells, in which the portion of the gene encoding the MTSP20 flanks 
the heterologous sequence so as to promote homologous recombination with a 

25 genomic gene encoding the MTSP20. 

The undesired or aberrant angiogenesis to be treated or prevented is 
associated with solid neoplasms, vascular malformations and cardiovascular 
disorders, chronic inflammatory diseases and aberrant wound repairs, circulatory 
disorders, crest syndromes, dermatological disorders, or ocular disorders. 

30 Vascular malformations and cardiovascular disorders to be treated or prevented 
include angiofibroma, angiolipoma, atherosclerosis, restenosis/reperfusion injury, 
arteriovenous malformations, hemangiomatosis and vascular adhesions, 
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dyschondroplasia with vascular hamartomas (Fafucci's syndrome), hereditary 
hemorrhagic telangiectasia (Rendu-Osler-Weber syndrome), or Von Hippie Lindau 
syndrome; the chronic inflammatory diseases to be treated or prevented are 
diabetes mellitus, hemophiliac joints. Inflammatory bowel disease, nonhealing 
5 fractures, periodontitis (rapidly progressing and juvenile), psoriasis, rheumatoid 
arthritis, venous stasis ulcers, granulations-burns, hypertrophic scars, liver 
cirrhosis, osteoradionecrosis, postoperative adhesions, pyogenic granuloma, or 
systemic sclerosis; the circulatory disorder to be treated or prevented is 
Raynaud's phenomenon; the crest syndromes to be treated or prevented are 

10 calcinosis, esophageal, dyomotiloty, sclerodactyly and teangiectasis; the 
dermatological disorders to be treated or prevented are systemic vasculitis, 
scleroderma, pyoderma gangrenosum, vasculopathy, venous, arterial ulcers, 
Sturge-Weber syndrome. Port-wine stains, blue rubber bleb nevus syndrome, 
Klippel-Trenaunay-Weber syndrome or Osler-Weber-Rendu syndrome; and the 

1 5 ocular disorders to be treated or prevented are blindness caused by ocular 

neovascular disease, corneal graft neovascularization, macular degeneration in 
the eye, neovascular glaucoma, trachoma, diabetic retinopathy, myopic 
degeratton, retinopathy of prematurity, retrolental fibroplasia, or corneal 
neovascularization. 

20 6. Treatment of deficient anglogenesis 

In another specific embodiment, a method for treating or preventing a 
disease or disorder associated with deficient angiogenesis in a mammal is 
provided herein by administering to a mammal an effective amount of an 
MTSP20 or a derivative or analog of the protein that is active in promoting 

25 angiogenesis, or a nucleic acid encoding the protein or a derivative or analog of 
the protein that is active In promoting angiogenesis, whereby the disease or 
disorder Is treated or prevented. The MTSP20 or portion thereof or a derivative 
or analog of the protein can be administered to a subject, such as a mammal, 
Including a human, with a pharmaceutically acceptable carrier or excipient. 

30 The treatment or prevention methods can further include administering a 

pro-angiogenic treatment or agent. The treatment can be used to promote 
physiological angiogenesis, particularly angiogenesis involved in normal 
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placentai, embryonic, fetal and post-natal development and growth, 
physiologically cyclical development in the ovarian follicle, corpus luteum and 
post-menstrual endometrium or wound healing. 

The pro-angiogenic agent used in the treatment can be, for example, a 
5 basic fibroblast growth factor such as bFGF and FGF-2, a vascular endothelial 
growth factor/vascular permeability factor such as VEGFA/PF and vasculotropin, 
a platelet-derived endothelial cell growth factor such as PO-EDGF and thymidine 
phosphorylase, a transforming growth factor-beta (TGF-y?), or angiopoietin-1 
(Ang-1|. 

10 M. Animal models 

Transgenic animal models and animals, such as rodents, including mice 
and rats, cows, chickens, pigs, goats, sheep, monkeys, including gorillas, and 
other primates, are provided herein. In particular, transgenic non-human animals 
that contain heterologous nucleic acid encoding an MTSP20 polypeptide or a 

1 5 transgenic animal in which expression of the polypeptide has been altered, such 
as by replacing or modifying the promoter region or other regulatory region of 
the endogenous gene are provided. Such an animal can by produced by 
promoting recombination between endogenous nucleic acid and an exogenous 
MTSP20 gene that could be over-expressed or mis-expressed, such as by 

20 expression under a strong promoter, via homologous or other recombination 
event. 

Transgenic animals can be produced by introducing the nucleic acid using 
any known method of delivery, including, but not limited to, microinjection, 
lipofection and other modes of gene delivery into a germline cell or somatic cells, 

25 such as an embryonic stem cell. Typically the nucleic acid is Introduced into a 
cell, such as an embryonic stem cell (ES), followed by injecting the ES cells into 
a blastocyst, and implanting the blastocyst into a foster mother, which is 
followed by the birth of a transgenic animal. Generally, introduction of a 
heterologous nucleic acid molecule into a chromosome of the animal occurs by a 

30 recombination between the heterologous MTSP20-encoding nucleic acid and 
endogenous nucleic acid. The heterologous nucleic acid can be targeted to a 
specific chromosome. 
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' In some instances, knockout animals can be produced. Such an animal 

can be Initially produced by promoting homologous recombination between an 
MTSP20 polypeptide gene in its chromosome and an exogenous MTSP20 
polypeptide gene that has been rendered biologically inactive (typically by 
5 insertion of a heterologous sequence, e.g., an antibiotic resistance gene). In one 
embodiment, this homologous recombination is performed by transforming 
embryo-derived stem (ES) cells with a vector containing the insertionally 

inactivated MTSP20 polypeptide gene, such that homologous recombination / 
occurs, followed by injecting the ES cells into a blastocyst, and implanting the 

10 blastocyst into a foster mother, followed by the birth of the chimeric animal 

("knockout animal") in which an MTSP20 polypeptide gene has been inactivated 
(see Capecchi, Science 2-^:1288-1292 (1989)). The chimeric animal can be 
bred to produce homozygous knockout animals, which can then be used to 
produce additional knockout animals. Knockout animals include, but are not 

15 limited to, mice, hamsters, sheep, pigs, cattle, and other non-human mammals. 
For example, a knockout mouse is produced. The resulting animals can serve 
as models of specific diseases, such as cancers, that exhibit under-expression of 
an MTSP20 polypeptide. Such knockout animals can be used as animal models 
of such diseases e.g., to screen for or test molecules for the ability to treat or 

20 prevent such diseases or disorders. 

Other types of transgenic animals also can be produced, including those 
that over-express the MTSP20 polypeptide. Such animals include "knock-in" 
animals that are animals in which the normal gene is replaced by a variant, such 
as a mutant, an over-expressed form, or other form. For example, one species', 

25 such as a rodent's endogenous gene can be replaced by the gene from another 
species, such as from a human. Animals also can be produced by non- 
homologous recombination into other sites in a chromosome; including animals 
that have a plurality of integration events. 

After production of the first generation transgenic animal, a chimeric 

30 animal can be bred to produce additional animals with over-expressed or mis- 
expressed MTSP20 polypeptides. Such animals include, but are not limited to, 
mice, hamsters, sheep, pigs, cattle and other non-human mammals. The 
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resulting animals can serve as models of specific diseases, such as cancers, that 
are exhibit over-expression or mis-expression of an MTSP20 polypeptide. Such 
animals can be used as animal models of such diseases e.g.. to screen for or 
test molecules for the ability to treat or prevent such diseases or disorders. In a 
5 specific embodiment, a mouse with over-expressed or mis-expressed MTSP20 
polypeptide is produced. 

The following examples are included for illustrative purposes only and are 
not intended to limit the scope of the invention. 

10 EXAMPLE 1 

Identification of IVITSP20 

The protein sequence of the protease domain of endotheliasel (ET1; also 
called DESC1, accession number XP_003340) was used to search the human 
HTGS (High Throughput Genomic Sequence) database using the tbiastn 

15 algorithm (http://www.ncbi.nlm.nih.gov/BLAST). This search and alignment 
algorithm compares a protein query sequence against a nucleotide sequence 
database dynamically translated in all six reading frames (both strands); IVITSP20 
was identified. MTSP20 shares 39% identity to a region of ET1 between aa 188 
and aa 233, and 30% identity to a region of ET1 between aa 239 to aa 314. A 

20 search using the algorithm blastp (http://www.ncbi. nIm.nih.gov/BLAST ) 

indicates that the translated sequence of these 2 regions in MTSP20 has —33% 
identity to the region flanking the catalytic aspartate of prostamin (accession 
number AB030036) and MTSP6 (see, International PCT application No. (WO 
01/57194), and 45% identity with the region flanking the catalytic histidine of 

25 prostasin (accession number NP_002764). Based on the incomplete and 
unordered human genome sequence, MTSP20 appears to be localized on 
chromosome 16 (clone accession number AC009088). A search of sequences 
deposited in GenBank showed that no identical sequence had been deposited. 
Further search of. the human EST database showed 2 EST clones (IMAGE cDNA 

30 clone 3535196; GenBank accession number: BE264142 and IMAGE cDNA clone 
4932600; GenBank accession number: BG819317) were identical to a portion of 
the MTSP20 sequence. IMAGE clone 3535196 is a publicly available clone 
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derived from human small cell lung carcinoma, and this clone was purchased 
from Incyte Genomics (Palo Alto. CA) and sequenced. The sequence of this EST 
clone did not match the deposited sequence. Instead, it matched that of human 
Sck, a neuronal She adapter (GenBank accession number AB001451). The 
5 second publicly available EST clone (IMAGE 4932600) was derived from human 
oligodendroma, but was unavailable. 

The MTSP20 and encoding nucleic acid has homology to a clone 
described in International PCT application No. WO 02/00860 (see SEQ ID Nos. 
27 and 86 therein; and SEQ ID No. 17 herein). The clone and predicted 

10 encoded polypeptide described in the PCT application, however, differ from the 
nucleic acid molecule encoding MTSP20 pand the MTSP20 provided herein. The 
nucleic acid molecule includes nucleic acids encoding amino acids 624-642 
absent from the clone in International PCT application No. WO 02/00860. Full 
length polypeptides, polypeptides containing two protease domains and 

15 polypeptides containing MTSPP20 PD2 are provided herein. 
Cloning of MTSP20 from human lung tissue 

Using the genomic sequence of MTSP20, two gene specific 
oligonucleotide primers were designed. The sequence for the 5' end primer is 
5'-GAAGCAGCTCATCCTGCATGGAGCCTACAC-3' SEQ ID No. 7 and that of the 

20 3' end primer is 5'-GGCCAGGAACCATGTGCCCCTCACCTC-3' SEQ ID No. 8. 
These primers were used to amplify a cDNA fragment from human fetal lung 
tissue (part of Human Fetal Multiple Tissue cDNA Panel ; Clontech, Palo Alto, 
CA; catalog no. K 1425-1). Several bands were detected. A band (—400 bp) 
whose size closely matched that of the predicted, translated genomic sequence 

25 was Isolated by 2% agarose gel electrophoresis, purified using the MinElute gel 
extraction kit (catalog number 28606; http://www.qiagen.com), subcloned Into 
an £ CO// vector (pCR2.1T0P0; catalog no. K-4500-01; 
http://www.invitrogen.com) and transformed into £. co// TOPI 0 cells 
(http://www.invitrogen.com). Subsequent sequence analysis using a fluorescent 

30 dye-based DNA sequencing method (catalog number 4390244; ABI PRISM* 

BigDye™ Terminator v 3.0 Ready Reaction Cycle Sequencing Kits with AmpliTaq* 
DNA Polymerase, FS; http://home.appIiedbiosystems.com) showed that the 
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nucleotide sequence of this cDNA fragment matched that of the genomic 
MTSP20 exon sequences. 

5'-RACE and gene-specific PCR amplification of cDNA encoding full-length 
protease domain of IVITSP20 

5 Using the derived cDNA sequence of MTSP20, 2 oligonucleotide primers 

hybridizing within the protease domain sequence were designed and 

synthesized. The sequence for the 5' end primer is 

5'-GCTGCTTCAGGCCCCACTCCTCCGGTCTGG-3' {SEQ ID No. 181, and that of 
the nested 5' end primer is 
10 5'-CACAGGCCAGCTGTCCCTGGTGCATCAGCC-3' (SEQ ID No, 19). The initial 
5'-RACE primer was used to amplify cDNA fragments from human lymph node 
Marathon-ready cDNA library (catalog number 7432-1; 

http://www.clontech.com). Following this reaction, a nested 5'-RACE reaction 
was performed using the nested primer. Several cDNA bands were detected in 

15 ail RACE reactions. The cDNA fragments were separated by 2% agarose gel 
electrophoresis, blotted onto a nylon membrane (Supercharge Nytran; 
http://www.s-und-s.de/Pages-NEU-eng/UB3/Life%20Science/nytran.htm} and 
hybridized with the "P-labeled cDNA fragment described previously. 

Several cDNA bands hybridized with the probe. The 2 largest positive 

20 cDNA bands were purified from the nested 5'-RACE reaction using the MinElute 
gel extraction kit, subcloned into an £. co// vector (pCR2.1T0P0) and 
transformed into £. co// TOPI 0 cells. Several independent clones were cultured 
and used to isolate plasmtd DNA. Subsequent sequence analysis confirmed that 
the nucleotide sequence contained some of the missing 5 '-end sequence of 

25 MTSP20. The 5'-RACE product did not extend to the beginning of the cDNA as 
the methionine start codon was missing. 

To obtain the cDNA fragment encoding the rest of the 5' end of MTSP20, 
a gene-specific PCR amplification using a cDNA library prepared from human 
aortic endothelial cells (HAEC-aortic endothelial cells; catalog number CC-2635; 

30 Cionetics, http://www.cambrex.com) was used. Two 5'-end primers were used: 



5'-AGCACGGCCGGCCAATCGCCGAGTCAGAGG-3' (SEQ ID No. 20) and 
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B'-GCCTCGGGCGGAACCTGGAGATA ATG GGC -3' (SEQ ID No. 21) (underlined 
sequence represents the putative start codon). The 1" 5'-end primer was about 
95-bp upstream of the 2"^ 5'-end primer and located in the 5'-UTR of MTSP20. 
For both PCR reactions, a single''MTSP20-specific 3'-end primer was used: 
5 5'-GACCAGGAATTCAGTTCTGTTGCTGCTGCC-3' (SEQ ID No. 22). The 3'-end 
primer corresponds to the sequence found in the S'-RACE product isolated and 
sequenced previously, and it occurs immediately downstream of the catalytic 
histidine. 

Two cDNA fragments ( — 600-bp and 510-bp) were isolated from the PCR 
10 reaction using the 1*' 5-end and 3'-end primer pair, and 2™" 5-end and 3'-end 
primer pair, respectively. The PCR products were isolated, purified using the 
MinElute gel extraction kit, subcloned into an f. co// vector (pCR2.1TOPO) and 
transformed into E. co// TOP 10 cells. Several independent clones were cultured 
and used to prepare plasmid DNA. Subsequent sequence analysis confirmed 
15 that the nucleotide sequence contained the missing 5'-end of MTSP20 including 
part of the 5'-UTR. 

Gene expression profile of MTSP20 in normal tissues, tumor tissues, endothelial 
cells and tumor cell lines 

To obtain information regarding the gene expression profile of the 

20 MTSP20 transcript, the MTSP20 cDNA fragment {-400 bp) obtained from 

human lung tissue was used to probe a dot blot composed of RNA extracted 

from 76 different human tissues (Human Multiple Tissue Expression (MTE) Array; 

Clontech, Palo Alto, CA; catalog no. 7775-1; http://www.clontech.com). The 

results indicate that MTSP20 is expressed in many tissues. The MTSP20 

25 transcript is found in liver, lymph node, cerebellum, pancreas, prostate, uterus, 

testis, glands (adrenal, thyroid and salivary), thymus, kidney and spleen. Lower 

transcript level is found in lung, placenta, bladder, ovary, digestive system, 

circulatory system and parts of the brain. 

To compare the expression profile of MTSP20 transcript in a range of 

30 normal human and matched tumor tissues, a matched tumor/normal expression 

array (catalog number 7841-1; http://www.clontech.com) composed of 241 

paired cDNA samples from individual patients was used. The membrane was 
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hybridized with the same ^^P-labeled cDNA fragment as described before. 
Results show that the MTSP20 transcript is expressed at a low level in a number 
of normal tissues including breast, prostate, cervix, uterus, ovary, colon, lung, 
small intestine, stomach, kidney, pancreas, thyroid and rectum, but is not 
5 differentially expressed in any of the matched tumors. MTSP20 is also 

expressed in certain tumor cell lines including lung carcinoma (AStS), colorectal 
carcinoma (SW480), lymphoma (Raji and Daudi), cervical carcinoma (HeLaS3) 
and leukemia (HL-60, K-562 and MOLT-4) cell lines. 
Reverse tra nscript ion-PC R using two gene-specific primers 

10 RT-PCR using 2 gene-specific primers (SEQ ID Nos. 9 and 10): 

5'-GAAGCAGCTCATCCTGCATGGAGCCTACAC-3' for the sense primer and 
5'-GGCCAGGAACCATGTGCCCCTCACCTC-3' for the anti-sense primer) showed 
the presence of MTSP20 in normal liver, normal lymph node, breast carcinoma 
cell line (MDA-MB-231) and in prostate carcinoma cell lines (LNCaP, PC-3 and 

15 CWR22R). This primer set amplifies a ~380-bp cDNA fragment of MTSP20. 

Using the same set of primers, several endothelial cell cDNA libraries 
were prepared from polyA^RNA purified from primary endothelial cell cultures 
(see, e.g., Clonetics; http://www.cambrex.com) and screened for the presence 
of the MTSP20 cDNA. Human aortic endothelial cells, human iliac artery 

20 endothelial cells, human lung microvascular endothelial cells and human umbilical 
cord endothelial cells alt showed the presence of similar levels of MTSP20 
cDNA. Human uterus microvascular endothelial cells also showed the presence 
of the MTSP20 cDNA at a lower level- To confirm the expression of MTSP20 by 
endothelial cells, another set of primers was used for PGR amplification: 5'- 

25 AAGCCTCAGGAGGGCAACACAGTCCCTG-3' (SEQ ID No. 23) for the sense 
primer and 

5'-ACAGCTGGGCAGCTCACCCACAGCACTGG-3' (SEQ ID No. 24) for the 
antisense primer. This primer set amplifies a larger, ^1 .5 kbp fragment of 
MTSP20 cDNA. PGR reactions using as templates cDNA libraries prepared from 
30 human umbilical cord endothelial cells, human itiac artery endothelial cells, 

human aortic endothelial cells, human lung microvascular endothelial cells and 
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human dermal microvascular endothelial cells produced the expected MTSP20 
cDNA fragment. 

Domain structures of MTSP20 and homology to other serine proteases 

Sequence and protein domain prediction analyses (see, e.g., 
5 http://smart.embl-heidelberg.de) of the translated MTSP20 protease domain 
sequence indicated that IS/ITSP20 has a signal peptide (aa 1 to aa 24), a 
transmembrane domain (aa 82 to aa 99) and two potential N-glycosylation sites 
(...N77QS... and ...Nq2iIS...). MTSP20 has two trypsin-like serine protease 
domains in tandem, hereafter referred to as MTSP20-PD1 (aa 113 to aa 343) 
10 and IV)TSP20-PD2 (aa 375 to aa 596). Both protease domains are characterized 
by the presence of a protease activation cleavage site at the beginning of the 
domain and the catalytic triad residues (histidine, aspartate and serine) in 3 
highly-conserved regions of the catalytic domain. 

MTSP20-PD1 has the following features: a protease cleavage site (...K^,2 
15 * PiiaQEGN..., where I indicates the cleavage site); catalytic triad residues 

(H,52, D203 and Sjgg); the following cysteine pairings within the protease domain 
(C 

137*^153' ^233*^305' ^zsat^zq^ ^^d C295'C324); an unpaired cysteine (C219) in the 

protease domain is predicted to pair with C^o3- 

MTSP20-PD2 has the following features: the protease cleavage site 

20 (...R375 i T376AGPQ...); catalytic triad residues {H^^^, D457 and 8553); the following 

cysteine pairings within the protease domain (0401-04,7, Csig-Cgjg and C549-C577); 

the unpaired cysteine (C477) in the protease domain is predicted to pair with C^jy. 

Alignment iblastP '. http://www.ncbl.nlm.nih.gov/BLAST ) of MTSP20-PD1 to 
protein sequences deposited in the NCBI protein database 

25 (http://www.ncbi.nlm.nih.g0v/blast/html/blastcgihelp.html#protein_databases) 
showed the following homology: marapsin (35% identity; accession number 
NP_1 14154.1); prostasin (33% identity); prostamin (31% identity; accession 
number AB030036); matriptase (31% identity, accession number NP_068813); 
serine protease PRSS22 (32% identity, accession number NM_0221 19); 

30 MTSP6/TADG12 (31%) and hepsin (32% identity, accession number 

NIV]_002151). Alignment of the protease domain sequence of MTSP20-PD2 
with that of gamma-li tryptase (accession number API 95508) and 
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transmembrane tryptase {accession number XP_008123) showed 37% and 36% 
identity, respectively. MTSP20-PD2 also shared homology to several other 
serine proteases including DESCl {33% identity); MTSP6/TAGD12 (33% 
identity); prostasin (33% identity); prostannin (33% identity); matriptase (32% 
5 identity); airway trypsin-like protease (31% identity; accession number 
NP_004253) and coagulation factor XI (30% identity, accession number 
M20218). Blast search of the coding region of MTSP20 against the sequences 
found in the Incyte database showed an apparent full-length clone {959903. FL1) 
that had a 1,661 -bp insert. Clustal W alignment (using MacVector version 

10 6.5.3; http://www.accelry3.com/products/macvector/index.htmi) of the nucleic 
acid and protein sequences of MTSP20 to the sequences derived from the Incyte 
clone showed 59% and 40% identity, respectively. 
Sequence analysis 

MTSP20 cDNA and protein sequences were analyzed using MacVector 

15 (version 6.5.3; http://www.accelrys.com/products/macvector/index.html. The 

full-length cDNA encoding MTSP20 is 2,066 bp long. The coding region is 

1,929 bp long which translates to a 642-amino acid protein as follows (see SEQ 

ID Nos. 15 and 16). MTSP20 cDNA and translated protein sequences: 

Sequence Range: 1 to 2066 
20 10 20 30 40 50 60 

AGCACGGCCGGCCAATCGCCGAGTCAGAGGGCCAGGAGGGGCGCGGCCATTCGCCGCCCG 
TCGTGCCGGCCGGTTAGCGGCTCAGTCTCCCGGTCCTCCCCGCGCCGGTAAGCGGCGGGC 
70 80 90 100 110 120 

GCCCCTGCTCCGTGGCTGGTTTTCTCCGCGGGCGCCTCGGGCGGAACCTCGAGATAATGG 
2 5 CGGGGACGAGGCACCGACCAAAAGAGGCGCCCGCGGAGCCCGCCTTGGACCTCTATTACC 

M 

130 140 150 160 170 180 

GCAGCACCTGGGGGAGCCCTGGCTGGGTCCGGCTCGCTCTTTGCCTGACGGGCTTAGTGC 
CGTCGTGGACCCCCTCGGGACCGACCCACGC CGAGCGAGAAACGGACTGCCCGAATCACG 
30 GSTWGSPGWVRLALCLTGLV 

190 200 210 220 230 240 

TCTCGCTCTACGCGCTCCACGTGAAGGCGGCGCGCGCCCGGGACCGGGATTACCGCGCGC 
AGAGCGAGATGCGCGACGTGCACTTCCGCCGCGCGCGGGCCCTGGCCCTAATGGCGCGCG 
LS LYALHVKAARARDRDYRA 
35 250 260 270 280 290 300 

TCTCCGACGTGGGCa.CCGCCATCAGCTGTTCGCGCGTCTTCTCCTCCAGGTGGGGCAGGG 
AGACGCTGCACCCGTGGCGGTAGTCGACAAGCGCGCAGAAGAGGAGGTCCACCCCGTCCC 
LCDVGTAISCSRVFSSRWGR 

310 320 330 340 350 360 

40 GTTTCGGGCTGGTGGAGCATGTGCTGGGACAGGACAGCATCCTCAATCAATCCAACAGCA 

CAAAGCCCGACCACCTCGTACACGACCCTGTCCTGTCGTAGGAGTTAGTTAGGTTGTCGT 
GFGLVEHVLGQDS I LNQSNS 

370 380 390 400 410 420 

TATTCGGTTGCATCTTCTACACACTACAGCTATTGTTAGGTCTTCAAGC CG CTCAGCGTG 
45 ATAAGCCAACGTAGAAGATGTGTGATGTCGATAACAATCCAQAAGTTCGGCQAGTCGCAC 

IFGCI FYTLQLLLGLQAAQR 

430 440 450 460 470 4S0 
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CCTGTGGA CAGCGTGGCCCCGGCCCC CCCAAGCCTCAGGAGGGCAACACAGTCCCTGGCG 
GGACACCTGTCGCACCGGGGCCGGGGGGGTTCGGAGTCCTCCCGTTGTGTCAGGGACCGC 
ACGQRGPGPPKPQEGNTVPG 

490 500 510 520 530 540 

A6TGGCCCTGGCAGGCCAGTGTGAGGAGGCAAGGAGCCCACATCTGCAGCGGCTCCCTGG 
TCACCGGGACCGTC CGGTCACACTCCTCCGTTC CTCGGGTGTAGACGTCGCCGAGGGACC 
EWPWQASVRRQGAHI CSGSL 

550 5S0 570 580 590 • 600 

TGGCAGACACCTCGGTCCTCACTGCTGCCCACTGCTTTGAAAAGGCAGCAGCAACAGAAC 
ACCGTCTGTGGACCCAGGAGTGACGACGGGTGACGAAACTTTTCCGTCGTCGTTGTCTTC 
VADTWVLTAAHCFEKAAATE 

610 620 630 640 650 660 

TGAATTCCTGGTCAGTGGTCCTGGGTTCTCTGCAGCGTGAGGGACTCAGCCCTGGGGCCG 
ACTTAAGGACCAGTCACCAGGACCCAAGAGACGTCGCACTCCCTGAGTCGGGACCCCGGC 
LH5WSVVLGSLQREGLSPGA 

S70 680 690 700 710 720 

AAGAGGTGGGGGTGGCTGCCCTGCAGTTGCCCAGGGCCTATAACCACTACAGCCAGGGCT 
TTCTCCACCCCCACCGACGGGACGTCAACGGGTCCCGGATATTGGTGATGTCGGTCCCGA 
EEVGVAALQLPRAYNHYSQG 

730 740 750 760 770 780 

CAGACCTGGCCCTGCTGCAGCTCGCCCACCCCACOACCCACACACCCCTCTGCCTGCCCC 
GTCTGGACCGGGACGACQTCGAGCGGGTGGGGTGCTGGGTGTGTGGGGAGACGGACGGGG 
SDLALLQLAHPTTHTPLCLP 

790 800 810 820 830 840 

AGCCCGCC C ATCGCTTCCCCTTTGGAGCCTCCTGCTGGGC CACTGG CTGGGATCAGGACA 
TCGGGCGGGTAGCGAAGGGGAAACCTCGGAGGACGACCCGGTGACCGACCCTAGTCCTGT 
QPAHRFPFGASCWATGWDQD 

650 860 870 680 890 900 

CCAGTGATGCTCCTGGGACCCTACGCAATCTGCGCCTGCGTCTCATCAGTCGCCCCACAT 
GGTCACTACGAGGACCCTGGGATGCGTTAGACGCGGACGCAGAGTAGTCAGCGGGGTGTA 
TSDAPGTLRNLRLRLl SRPT 

910 920 930 940 950 960 

GTAACTGTATCTACAACCAGCTGCAC CAGCGACACCTGTCCAACC CGGCCCGGCCTGGGA 
CATTGACATAGATGTTGGTCGACGTGGTCGCTGTGGACAGGTTGGGCCGGGCCGGACCCT 
CNCIYNQLHORHLSNPARPG 

970 980 990 1000 1010 1020 

TGCTATGTGGGGGCCCCCAGCCTGGGGTGCAGGGCCCCTGTCAGGGAGATTCCGGGGGCC 
ACGATACACCCCCGGGGGTCGGACCCCACGTCCCGGGGACAGTCCCTCTAAGGCCCCCGG 
MLCGGPQPGVQGPCQGDSGC 

1030 1040 1050 1060 1070 1080 

CTGTGCTGTGCCTCGAGCCTGACGGACACTGGGTTCAGGCTGGCATCATCAGCTTTGCAT 
GACACGACACGGAGCTCGGACTOCCTGTGACCCAAGTCCGACCGTAGTAGTCGAAACGTA 
PVLCLEPDGHWVQAGI ISFA 

1090 1100 1110 1120 1130 1140 

CAAGCTGTGCCCAGGAGGACGCTCCTGTGCTGCTGACCAACACAGCTG CTCACAGTTC CT 
GTTCGACACGGGTCCTCCTGCGAGGACACGACGACTGGTTGTGTCGACGAGTGTCAAGGA 
SSCAQEDAPVLLTNTAAHS S 

1150 1160 1170 1180 1190 1200 

GGCTGCAGGCTCGAGTTCAGGGGG CAGCTTTC CTGGCCC AGAGC CCAGAGACCCCGGAGA 
CCGACGTCCGAGCTCAAGTCCCCCGTCGAAAGGACCGGGTCTCGGGTCTCTGGGGCCTCT 
WLQARVQGAAFLAQSPETPE 

1210 1220 1230 1240 1250 1260 

TGAGTGATGAGGACAGCTGTGTAGCCTGTGGATCCTTGAGGACACCAGGTCCCCAGGCAG 
ACTCACTACTCCTGTCGACACATCGGACACCTAGGAACTCCTGTCGTCCAGGGGTCCGTC 
MSDEDSCVACGSLRTAGPQA 

1270 1280 1290 1300 1310 1320 

GAGCACCCTCCCCATGGCCCTGGGAGGCCAGGCTGATGCACCAGGGACAGCTGGCCTGTG 
CTCGTGGGAGGGGTACCGGGACCCTCCGGTCCGACTACGTGGTCCCTGTCGACCGGACAC 
GAPSPWPWEARLMHQGQLAC 

1330 1340 1350 1360 1370 1380 

GCGGAGCCCTGGTGTCAGAGGAGGCGGTGCTAACTGCTGCCCACTGCTTCATTGGGCGCC 
CGCCTCGGGACCACAGTCTCCTCCGCCACGATTGACGACGGGTCACGAAGTAACCCGCGG 
GGALVSEEAVLTAAHCFIGR 

1390 1400 1410 1420 1430 1440 

AGGCCCCAGAGGAATGGAGCGTAGGGCTGGGGACCAGACCGGAGGAGTGGGGCCTGAAGC 
TCCGGGGTCTCCTTACCTCG CATC CCGACCC CTGGTCTGGCCTCCTCACCCCGGACTTCO 
QAPEEWSVGIiGTRPEEWGLK 

1450 1460 1470 1480 1490 1500 

AGCTCATCCTGCATGGAGCCTACACCCACCCTGAGGGGGGCTACGACATGGCCCTCCTGC 
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TCGAGTAGGACGTACCTCGGATGTGGGTGGGACTCCCCCCGATGCTGTACCGGGAGGACG 
QLI LHGAYTHPEGGYDMALL 

X510 1520 1530 1540 1550 1560 

TGCTGGCCCAGCCTGTGACACTGGGAGCCAGC CTG CGGCCCCTCTG CCTG CCCTATCCTG 
5 ACGACCGGGTCGGACACTGTGACCCTCGGTCGGACGCCGGGGAGACGGACGGGATAGGAC 
LLAQPVTLGASLRPLCLPyP 

1570 1580 1590 1600 1610 1620 

ACCACCACCTGCCTGATGGGGAGCGTGGCTGGGTTCTGGGACGGGCCCeCCCAGGAGCAG 
TGGTGGTGGACGGACTACCCCTCGCACCGACCCAAGACCCTGCCCGGGCGGGTCCTCGTC 
10 DHHLPDGERGWVLGRARP6A 

1630 1640 1650 1660 1670 1680 

GCATCAGCTCCCTCCAGACAGTGCCCCTGACCCTCCTGGGGCCTAGGGCCTGCAGCCGGC 
CGTAGTCGAGGGAGGTCTGTCACGGGCACTGGGAGGACCCCGGATCCCGGACGTCGGCCG 
GISSLOTVPVTLLGPRACS R 
15 1690 1700 1710 1720 1730 1740 

TGCATGCAGCTCCTGGGGGTGATGGCAGCCCTATTCTGCCGGGGATGGTGTCTACCAGTG 
ACGTACGTCGAGGACCCCCACTACCGTCGGGATAAGACGGCCCCTACCACACATGGTCAC 
LHAAPGGDGSPILPGMVCTS 

1750 1760 1770 1780 1790 1800 

2 0 CTGTGGGTGAGCTGCC CAGCTGTGAGGGCCTGTCTGGGGCACCACTGGTGCATGAGGTGA 

GACACCCACTCGACGGGTCGACACTCCCGGACAGACCCCGTGGTGACCACGTACTCCACT 
AVGELPSCEGLSGAPLVHEV 

IBIO 1820 1830 1840 1850 1860 

GGGGCACATGGTTCCTGGCCGGGCTGCACAGCTTCGGAGATGCTTGCCAAGGCCCCGCCA 
25 CCCCGTGTACCAAGGACCGGCCCGACGTGTCGAAGCCTCTACGAACGGTTCCGGGGCGGT 

RGTWFLAGLHSFGDACQGPA 

1870 1880 1B90 1900 1910 1920 

GGCCGGCGGTCTrCACCGCGCTCCCTGCCTATGAGGACTGGGTCAGCAOTTTGGACTGGC 
CCGGCCGCCAQAAGTGOCGCGAGGGACGGATACTCCTGACCCAGTCGTCAAACCTGACCG 
30 RPAVFTALPAYEDWVSSLDW 

1930 1940 1950 1960 1970 1960 

AGGTCTACTTCGCCGAGGAACCAGAGCCCGAGGCTGAGCCTGGAAGCTGCCTGGCCAACA 
TCCAGATGAAGCGGCTCCTTGGTCTCGGGCTCCGACTCGGACCTTCGACGGACCGGTTGT 
QVYFAEEPEPEAEPGSCLAN 
35 1990 2000 2010 2020 2030 2040 

TAAGTATGTGGCCCCGGGGCCTCCTGCCAAACCCTGCCTCTCCAGGACCCTTCTCTCTCC 
ATTCATACACCGGGGCCCCGGAGGACGGTTTGGGACGGAGAGGTCCTGGGAAGAGAGAGG 
ISMWPRGLIjPNPASFGPFSL 

2050 2060 
40 AGTGAGAACGTGCCGTTCTAACCCAC 

TCACTCTTGCACGGCAAGATTGGGTG 
Q * 

MTSP20 protein sequence 
45 Sequence Range: 1 to 64 3 

10 20 30 40 50 60 

MGSTWGS PGWVRLALCLTGLVLSLYALHVKAARARDRDYRALCDVGTAI SC S RVFSSRWG 
70 80 90 100 110 120 

RGFGLVEHVLGQDS I LNQSNS I FGCI F YTLQLLLGLQAAQRACGQRGPG P P KPQEGNTVP 
50 130 140 150 160 170 180 

GE WPWQAS VRRQGAH I CS GS LVADTWVLTAAHCFEKAAATELNS WS WLG SLQREGLS PG 
190 200 210 220 230 240 

AE EVGVAALQLPRAYNHYS0GSDI*AL.LQI*AH PTTHTPLCL PQPAHRFP FGAS CWATGWDQ 
250 260 270 280 290 300 

5 5 DTSDAPGTLRNLRLRLISHPTCNCIYNQLHQRHLSNPARPGHLCGGPQPGVOGPCQGDSG 

310 320 330 340 350 360 

GPVIiCLEPDGHWVQAGII S FAS S CAQEDAPVL LTNTAAH S S WLQARVQGAAFLAQS P ETP 

370 380 390 400 410 420 

EMSDEDSCVACGSLRTAGPQAGAPSPWPWEARLMHQGOLACGGALVSEEAVLTAAHCFIG 
60 430 440 450 460 470 480 

RQAPEEWSVGLGTRPEEWGLKQblLHGAYTHPEGGYDMALLLLAQPVTLGASLRPLCLPY 

490 500 510 520 530 540 

PDHHLPDGERGWVLGRAR PGAGI SSLQTVPVTLLGPRACSRIiHAAPGGDGS PILPGMVCT 

SSO 560 570 580 590 600 

6 5 SAVGELPSCEGLSGAPLVHEVRGTWFLAGLHSFGDACQGPARPAVFTALPAYEDWVSSLD 

610 620 630 640 

WQVY FAEEPEPEAE PGS CLAN I SMW P RGLL PNPASPG P FS LQ ♦ 
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EXAMPLE 2 

Expression of the protease MTSP domains 

Nucleic acid encoding the MTSP20 and protease domain thereof 
can be cloned into a derivative of the Pichia pastoris vector pPIC9K 
5 (available from Invitrogen; see SEQ ID NO. 13). Plasmid pPICSk features 
include the 5' AOXl promoter fragment at 1-948; 5' A0X1 primer site at 
855-875; alpha-factor secretion signal(s) at 949-1218; alpha-factor 
primer site at 1 1 52-1 1 72; multiple cloning site at 1 1 92-1 241 ; 3' AOXl 
primer site at 1327-1347; 3' AOXl transcription termination region at 

10 1253-1586; HIS4 ORF at 4514-1980; kanamycin resistance gene at 
5743-4928; 3' AOXl fragment at 6122-6879; ColEI origin at 
7961-7288; and the ampicillin resistance gene at 8966-8106. The 
plasmid used herein is derived from pPIC9K by eliminating the Xhol site in 
the kanamycin resistance gene and the resulting vector is herein 

15 designated pPIC9KX. Expression in Pichia can be performed using known 
methods (see, e.g., Zhang et al. (2000) Biotechnology and Bioengineehng 
70: No 1 October 5, 2000), 

Mutagenesis of a Protease Domain of MTSP20 for Expression In Pichia 

Each or both protease domains can be mutated so that the 
20 unpaired Cys is replaced with another amino acid, such as Ser, to 
decrease or prevent aggregation of the expressed polypeptide. 

EXAMPLE 3 

Assays for identification of candidate compounds that modulate that 
activity of an MTSP 

25 Assay for screening MTSP20 Inhibitors 

A protease domain (or plurality thereof) of MTSP20 expressed in 
Pichia pastoris is assayed for inhibition by various compounds as follows 
in Costar 96 well tissue culture plates (Corning NY). Approximately 1-10 
nM MTSP20 is added without inhibitor, or with 100000 nM inhibitor and 
30 7 1:6 dilutions to IX direct buffer (29.2 mM Tris, pH 8.4, 29.2 mM 
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10 



Imidazole, 217 mM NaCI (100 //I final volume)), and allowed to incubate 
at room temperature for 30 minutes. 400 substrate Spectrozyme t- 
PA (American Diagnostica,, Greenwich, CT) is added and reaction is 
monitored in a SpectraMAX Plus rfiicroplate reader (Molecular Devices, 
Sunnyvale CA) by following change in absorbance at 405 nm for 20 
minutes at 37 °C. Spectrozyme UK can also be used as the substrate in 
this assay. 

Identification of substrates 

Other substrates for use in the assays can be identified empirically 
by testing substrates. The following list of substrates are exemplary of 
those that can be tested. 



Substrate name 


Structure 


S 2366 


pyroGlu-Pro-Arg-pNA.HCI 


spectrozyme t-PA 


CHaSOj-D-HHT-Gly-Arg-pNA.AcOH 


N-p-tosyl-Gly-Pro-Arg-pNA 


N-p-tosyl-Gly-Pro-Arg-pNA 


Benzoyl-Val-Gly-Arg-pNA 


Benzoyl-Val-Gly-Arg-pNA 


Pef a chrome t-PA 


CHaSOj-D-HHT-Gly-Arg-pNA 


S 2765 


N-a-Z-D-Arg-Gly-Arg-pNA.2HCI 


S 2444 


pyroGlu-Gly-Arg-pNA.HCI 


8 2288 


H-D-lle-Pro-Arg-pNA.2HCI 


spectrozyme UK 


Cbo-L-(K)Glu(ff-t-BuO)-Glv-Arg-pNA.2AcOH 


S 2302 


H-D-Pro-Phe-Arg-pNA.2HCI 


S 2266 


H-D-Vat-Leu-Arg-pNA.2HCI 


S 2222 


Bz-lle-Glu(g-OR)-Glv-Arg-pNA.HCI 
R = H(50%) and R = CH3(50%) , 


Chromozyme PK 


Benzoyl-Pro-Phe-Arg-pNA 


S 2238 


H-D-Phe-Pip-Arg-pNA.2HCI 


S 2251 


H-D-Val-Leu-Lys-pNA.2HCI 


Spectrozyme PI 


H-D-Nle-HHT-Lys-pNA.2AcOH 




Pyr-Arg-Thr-Lys-Arg-AMC 




H-Arg-Gtn-Arg-Arg-AMC 




Boc-Gln-Gly-Arg-AMC 




^-Arg-Arg-AMC 


Spectrozyme THE 


H-D-HHT-Ala-Arg-pNA.2AcOH 


Spectrozyme fXlla 


H-D-CHT-Gly-Arg-pNA.2AcOH 




CVS 2081-6 (MeSGz-dPhe-Pro-Arg-pNA) 



15 



20 



25 



30 



35 
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I [Pefachrome fVlla (CHaSOg-D-CHA-But-Arg-pNA) [ 

pNA - para-nitranilide (chromogenic) 

AMC = amino methyl coumarin (fluorescent) 

If none of the above substrates are cleaved, a coupled assay can be used. 

5 Briefly^ a coupled assay tests the ability of the protease to activate an enzyme, 

such as plasminogen and trypsinogen. To perform these assays, a single chain 

protease domain (or plurality thereof) is incubated with a zymogen, such as 

plasminogen or trypsinogen, in the presence of the a known substrate, such as a 

chromogenic substrate (i.e., S2366, see TABLE above) for the zymogen. If the 

10 single chain activates the zymogen, the activated enzyme, such as plasmin and 

trypsin, will degrade the substrate therefor {i.e., S2366). 

EXAMPLE 4 

Other Assays 

These assays are described with reference to MTSP1, but such assays 

15 can be readily adapted for use with MTSP20. 

Amidolytic Assay for Determining Inhibition of Serine Protease 
Activity of iVIatriptase or MTSP1 

The ability of test compounds to act as inhibitors of rMAP catalytic 
activity is assessed by determining the inhibitor-induced inhibition of amidolytic 
20 activity by the MAP, as measured by IC^q values. The assay buffer is HBSA (10 
mM Hepes, 150mM sodium chloride, pH 7.4, 0.1% bovine serum albumin). All 
reagents are from Sigma Chemical Co. (St. Louis, MO), unless otherwise 
indicated. 

Two ICgo assays (a) one at either 30-minutes or 60-minutes (a 30-minute 
25 or a 60-minute preincubation of test compound and enzyme) and (b) one at 

0-minutes (no preincubation of test compound and enzyme) is conducted. For 
the IC50 assay at either 30-minutes or 60-minutes, the following reagents are 
combined in appropriate wells of a Corning microtiter plate: 50 microliters of 
HBSA, 50 microliters of the test compound, diluted (covering a broad 
30 concentration range) in HBSA (or HBSA alone for uninhibited velocity 

measurement), and 50 microliters of the rMAP (Corvas International) diluted in 
buffer, yielding a final enzyme concentration of 250 pM as determined by active 
site filtration. Following either a 30-minute or a 60-minute incubation at ambient 
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temperature, the assay is initiated by the addition of 50 nnicroliters of the 
substrate S-2765 (N-a-Benzyloxycarbonyl-D-arginyl-L-glycyl-L-arginine-p- 
nitroaniline dihydrochloride; DiaPharnna Group, Inc.; Franklin, OH) to each well, 
yielding a final assay volume of 200 microliters and a final substrate 
5 concentration of lOO^yM (about 4-times KJ. Before addition to the assay 

mixture, S-2765 is reconstituted in deionized water and diluted in HBSA. For the 
IC50 assay at 0 minutes; the same reagents are combined: 50 microliters of 
HBSA, 50 microliters of the test compound, diluted (covering the identical 
concentration range) in HBSA (or HBSA alone for uninhibited velocity 

10 measurement), and 50 microliters of the substrate S-2765. The assay is 

initiated by the addition of 50 microliters of rMAP. The final concentrations of 
all components are identical In both assays (at 30- or 60- and 0-minute). 

The initial velocity of chromogenic substrate hydrolysis is measured in 
both assays by the change of absorbance at 405 nM using a Thermo Max® 

15 Kinetic Microplate Reader (Molecular Devices) over a 5 minute period, in which 
less than 5% of the added substrate is used. The concentration of added 
inhibitor, which caused a 50% decrease in the initial rate of hydrolysis is defined 
as the respective IC^q value in each of the two assays (30- or 60-minutes and 
0-minute). 

20 /n vitro enzyme assays for specificity determination 

The ability of compounds to act as a selective inhibitor of matriptase 
activity is assessed by determining the concentration of test compound that 
inhibits the activity of matriptase by 50%, (IC50) as described in the above 
Example, and comparing IC50 value for matriptase to that determined for all or 

25 some of the following serine proteases: thrombin, recombinant tissue 

plasminogen activator (rt-PA), plasmin, activated protein C, chymotrypsin, factor 
Xa and trypsin. 

The buffer used for all assays is HBSA (10 mM HEPES, pH 7.5, 150 mM 
sodium chloride, 0.1% bovine serum albumin). 
30 The assay for IC50 determinations is conducted by combining in 

appropriate wells of a Corning microliter plate, 50 microliters of HBSA, 50 
microliters of the test compound at a specified concentration (covering a broad 
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concentration range) diluted in HBSA (or HBSA alone lor Vq (uninhibited velocity) 
measurement), and 50 microliters of the enzyme diluted in HBSA. Following a 
30 minute incubation at ambient temperature, 50 microliters of the substrate at 
the concentrations specified below are added to the wells, yielding a final total 
5 volume of 200 microliters. The initial velocity of chromogenic substrate 

hydrolysis is measured by the change in absorbance at 405 nm using a Thermo 
Max® Kinetic MIcroplate Reader over a 5 minute period in which less than 5% of 
the added substrate is used. The concentration of added inhibitor which caused 
a 50% decrease in the initial rate of hydrolysis is defined as the ICgg value. 

10 Thrombin (file) Assay 

Enzyme activity is determined using the chromogenic substrate, 
Pefachrome t-PA (CHjSOj-D-hexahydrotyrosine-glycyl-L-Arginine-p-nitroaniline, 
obtained from Pentapharm Ltd.). The substrate is reconstituted in deionized 
water prior to use. Purified human o-thrombin is obtained from Enzyme 

15 Research Laboratories, Inc. The buffer used for all assays is HBSA (10 mM 
HEPES, pH 7.5, 150 mM sodium chloride, 0.1% bovine serum albumin). 

IC50 determinations are conducted where HBSA (50 fjL), o-thrombin (50 
//I) (the final enzyme concentration is 0.5 nM) and inhibitor (50 pi) (covering a 
broad concentration range), are combined in appropriate wells and incubated for 

20 30 minutes at room temperature prior to the addition of substrate 

Pefachrome-t-PA {BO fj\) (the final substrate concentration is 250 ;yM, about 5 
times Km). The initial velocity of Pefachrome t-PA hydrolysis is measured by the 
change in absorbance at 405 nm using a Thermo Max® Kinetic Microplate Reader 
over a 5 minute period in which less than 5% of the added substrate is used. 

25 The concentration of added inhibitor which caused a 50% decrease in the initial 
rate of hydrolysis is defined as the IC^o value. 
Factor Xa 

Factor Xa catalytic activity is determined using the chromogenic substrate 
S-2765 (N-benzyloxycarbonyl-D-arginine-L-glycine-L-arginine-p-nitroaniline), 
30 obtained from DiaPharma Group (Franklin, OH). All substrates are reconstituted 
in deionized water prior to use. The final concentration of S-2765 is 250 //M 
(about 5-times Km). Purified human Factor X is obtained from Enzyme Research 
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Laboratories, Inc. (South Bend, IN) and Factor Xa (FXa) is activated and prepared 
from it as described (Bock, P.E., Craig, P.A., Olson, ST., and Singh, P. Arch. 
Biochem. Biophys. 273:375-388 (19891). The enzyme was diluted into HBSA 
prior to assay in which the final concentration is 0.25 nM. 
5 Recombinant tissue plasminogen activator (rt-PA) Assay 

rt-PA catalytic activity is determined using the substrate, Pefachrome 
t-PA (CHjSOj-D-hexahydrotyrosine-glycyl-L-arglnine-p-nitroaniline, obtained from 
Pentapharm Ltd.). The substrate is made up in deionized water followed by 
dilution in HBSA prior to the assay in which the final concentration is 500 
10 micromolar (about 3-times Km). Human rt-PA (Activase®) is obtained from 

Genentech Inc. The enzyme is reconstituted in deionized water and diluted into 
HBSA prior to the assay in which the final concentration is 1 .0 nM. 
Plasmin Assay 

Plasmin catalytic activity is determined using the chromogenic substrate, 
15 S-2366 (L-pyroglutamyl-L-prolyl-L-arginine-p-nitroaniline hydrochloride), which is 
obtained from DiaPharma group. The substrate is made up in deionized water 
followed by dilution in HBSA prior to the assay in which the final concentration 
is 300 micromolar (about 2.5-times Km). Purified human plasmin is obtained 
from Enzyme Research Laboratories, Inc. The enzyme is diluted into HBSA prior 
20 to assay in which the final concentration is 1.0 nM. 

Activated Protein C (a PC) Assay 
aPC catalytic activity is determined using the chromogenic substrate, 
Pefachrome PC (delta-carbobenzloxy-D-tysine-L-prolyl-L-arginine-p-nitroaniline 
dihydrochloride, obtained from Pentapharm Ltd.). The substrate is made up in 
25 deionized water followed by dilution in HBSA prior to the assay in which the final 
concentration is 400 micromolar (about 3-times Km), Purified human aPC is 
obtained from Hematologic Technologies, Inc. The enzyme is diluted into HBSA 
prior to assay in which the final concentration is 1 .0 nM. 
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Chymotrypsin Assay 

Chymotrypsin catalytic activity is determined using the chromogenic 
substrate, S-2586 (methoxy-succinyl-L-arginine-L-prolyl-L-tyrosyl-p-nitroanilide), 
which is obtained from DiaPharma Group. The substrate is made up in deionized 
5 water followed by dilution in HBSA prior to the assay in which the final 

concentration is 100 mlcromofar (about 9-times Km). Purified OX-crystallized; 
GDI) bovine pancreatic alpha-chymotrypsin is obtained from Worthington 
Biochemical Corp. The enzyme is reconstituted in deionized water and diluted 
into HBSA prior to assay in which the final concentration is 0.5 nM. 

10 Trypsin Assay 

Trypsin catalytic activity is determined using the chromogenic substrate, 
S-2222 (benzoyl-L-isoleucine-L-glutamic acid-[gamma-methyl esterl-L- 
arginine-p-nitroanilide), which is obtained from DiaPharma Group. The substrate 
is made up in deionized water followed by dilution in HBSA prior to the assay in 

15 which the final concentration is 250 micromolar (about 4-times Km). Purified 
(3X-crystallized; TRL3) bovine pancreatic trypsin is obtained from Worthington 
Biochemical Corp. The enzyme is reconstituted in deionized water and diluted 
into HBSA prior to assay in which the final concentration is 0.5 nM. 

20 Since modifications will be apparent to those of skill in this art, it is 

intended that this invention be limited only by the scope of the appended claims. 
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WHAT IS CLAIMED IS: 

1 . A substantially purified single or multi-chain polypeptide, 
comprising a protease domain of a type-ll membrane-type serine protease 20 
{MTSP20) or a catalytically active portion thereof, wherein the polpeptide 

5 comprises two protease domains. 

2. A substantially purified polypeptide that comprises an MTSP20 
protease domain 2 (PD2) domain. 

3. A substantially purified polypeptide, that consists essentially of an 
MTSP20 protease domain 1 (PDI) domain. 

10 4. The polypeptide of claim 1 that is an activated two or three- chain 

polypeptide. 

6. A polypeptide of any of claims 1-4 selected from the group 
consisting of: 

a polypeptide that comprises a sequence of amino acids encoded 
15 by the sequence that includes at least about 60%, 70%, 80% or 90% 

amino acid sequence identity with the sequence of amino acids set forth 
in SEQ ID No. 6 or SEQ ID No. 16; 

a polypeptide that comprises a sequence of amino acids encoded 
by the sequence that includes at least about 60%, 70%, 80% or 90% 
20 amino acid sequence identity with the sequence of amino acids set forth 

In SEQ ID No. 6 or SEQ ID No. 6; 

a polypeptide that comprises a sequence of amino acids encoded 
by the sequence of nucleotides set forth in SEQ ID No. 5 or SEQ ID No. 
15; 

25 a polypeptide that comprises a sequence of amino acids encoded 

by a sequence of nucleotides that hybridizes along at least 70% of its 
full-length under conditions of high stringency to the sequence of 
nucleotides set forth as SEQ ID No. 5 or SEQ ID No. 15 or to a portion 
thereof that encodes at least one protease domain that includes amino 

30 acids 624-642 of SEQ ID No. 16; 
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a polypeptide that is encoded by a sequence of nucleotides that is 
a splice variant of the sequence of nucleotides that comprises the 
sequence set forth in SEQ ID No. 5 or SEQ ID No. 15. 

6. The substantially purified polypeptide of any of claims 1-5, 
5 wherein the MTSP20 is a human polypeptide. 

7. The substantially purified polypeptide of claim 3 that consists 
essentially of the PD2 of MTSP20 or a catalytically active portion of the the PD2 
protease domain of MTSP20. 

8. The substantially purified polypeptide of claim 1 that comprises the 
10 sequence of amino acids set forth as amino acids in SEQ ID No. 16. 

9. A substantially purified polypeptide of claim 2 that consists 
essentially of the sequence of amino acids set forth as residues 376-624 in SEQ 
ID No. 1 6 or catalytically active fragments thereof. 

10. The polypeptide of claim 9 that further includes one or more 

15 contiguous residues 625 through 642 of SEQ ID No. 16 following residue 624. 

11. A substantially purified polypeptide of claim 3 that consists 
essentially of the sequence of amino acids set forth in SEQ ID No. 6 or amino 
acids 1 13-343 of SEQ ID No. 16 or catalytically active fragments of the 
sequence of amino acids set forth in SEQ ID No, 6 or amino acids 1 13-343 of 

20 SEQ ID No. 16. 

12. The substantially purified polypeptide of claim 1 that has at least 
about 60%, 70%, 80% or 90% sequence identity with a polypeptide that 
comprises the sequence of amino acids set forth as SEQ ID No. 16, wherein the 
substantially purified polypeptide is a protease. 

25 13. A polypeptide of any of claims 1-4, wherein a protease domain 

portion is encoded by a nucleic acid molecule that hybridizes under conditions of 
at least low stringency along at least 70% of its full-length to a nucleic acid 
molecule comprising a sequence of nucleotides set forth in SEQ ID No. 5 or that 
encodes a catalytically active portion of a protease domain or that hybridizes 

30 under conditions of at least low stringency along at least 70% of its full-length 
to a nucleic acid molecule that encodes residues 376-642 of SEQ ID No. 16. or 
that encodes a catalytically active portion of a protease domain. 
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14. The polypeptide of claim 13, wherein the conditions of 
hybridization are high stringency conditions. 

15. A polypeptide of any of claims 1-4 that is a mutein, wherein: 
up to about 50% of the amino acids are replaced with another amino 

5 acid; 

and the resulting polypeptide is a single chain or multi-chain polypeptide 
that has catalytic activity of at least 1%, 5% or 10% of the unmutated 
polypeptide. 

16. The polypeptide of claim 15, wherein up to about 20% of the 
10 amino acids are replaced with another amino acid. 

17. The polypeptide of claim 1 5, wherein the resulting polypeptide is 
a single chain or multi-chain polypeptide and has catalytic activity of at least 1 % 
of the unmutated polypeptide. 

18. The polypeptide of claim 1 5, wherein: 

15 a cysteine in a protease domain is replaced with another amino acid; and 

the cysteine is one that is unpaired in a polypeptide that consists essentially of a 
protease domain. 

19. The polypeptide of claim 18, wherein the replacing amino acid is a 

serine. 

20 20. A nucleic acid molecule, comprising a sequence of nucleotides that 

encodes a polypeptide of any of claims 1-18. 

21 . The nucleic acid molecule of claim 20 that comprises a sequence 
of nucleotides selected from the group consisting of: 

(a) a sequence of nucleotides set forth as nucleotides 1-642 in SEQ ID 
25 No, 16 or set forth in SEQ ID No. 5 or a portion thereof that encodes a 

proteolytically active polypeptide; 

(b) a sequence of nucleotides that hybridizes under high stringency along 
its length or along at least about 70% of the full-length to the sequence of 
nucleotides in SEQ ID No. 5 or SEQ ID No. 1 5 or to a portion thereof that 

30 encodes a protease domain of an MTSP20; 

(c) a sequence of nucleotides that encodes the polypeptide of SEQ ID No. 
6 or of SEQ ID No. 16; 
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(d] a sequence of nucleotides that is a splice variant of (a), {b) or (c); 
te) a sequence of nucleotides that encodes a protease domain or a 
catalyticatly active portion thereof that includes a sequence of 
nucleotides having at least about 60%, 70%, 80%, 90% or 95% 
5 sequence identity with the sequence set forth in SEQ ID No. 5 or 

SEQ ID No. 15; and 
{f) a sequence of nucleotides comprising degenerate codons of (a), 
(bl, (cK (d) or (e). 

22. An isolated nucleic molecule that encodes a mutein polypeptide of 
10 claim 1 5. 

23. A vector comprising the nucleic acid molecule of claim 20. 

24. The vector of claim 23 that is an expression vector. 

25. The vector of claim 23 that is a eukaryotic vector. 

26. The vector of any of claims 23-25 that includes a sequence of 
15 nucleotides that directs secretion of any polypeptide encoded by a sequence of 

nucleotides operativety linked thereto. 

27. The vector of claim 26 that is a Pichia vector or an E coH vector. 

28. A cell, comprising the vector of claim 23. 

29. The cell of claim 28 that Is a prokaryotic cell. 
20 30. The cell of claim 28 that is a eukaryotic cell. 

31 . The cell of claim 28 that is selected from among a bacterial cell, a 
yeast cell, a plant cell, an insect cell and an animal cell. 

32. The cell of claim 28 that is a mammalian ceil. 

33. A recombinant non-human animal, wherein an endogenous gene 
25 that encodes a polypeptide of claim 1 has been deleted or inactivated by 

homologous recombination or insertiona) mutagenesis of a gene of the animal or 
an ancestor thereof. 

34. A method for producing a polypeptide that contains a protease 
domain of an MTSP20 polypeptide, comprising: 

30 culturing the cell of claim 28 under conditions whereby the encoded 

polypeptide is expressed by the cell; and 

recovering the expressed polypeptide. 
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35. The method of claim 34, wherein the polypeptide is secreted into 
the culture medium. 

36. The method of claim 34 or 35, wherein the cell is a Pichia cell. 

37. An isolated cell that comprises a polypeptide of claims 1-8 and 11- 
5 18 expressed on Its surface. 

38. An antisense nucleic acid molecule that comprises at least 14 
contiguous nucleotides or modified nucleotides that are complementary to a 
contiguous sequence of nucleotides encoding an MTSP20 of any of claims 1-18; 
or 

10 comprises at least 16 contiguous nucleotides or modified nucleotides that 

are complementary to a contiguous sequence of nucleotides encoding the an 
MTSP20 polypeptide of any of claims 1-18; or 

comprises at least 30 contiguous nucleotides or modified nucleotides that 
are complementary to a contiguous sequence of nucleotides encoding an 

15 MTSP20 polypeptide of any of claims 1-18. 

39. The antisense molecule of claim 38 that includes a contiguous 
sequence of nucleotides that is the complement of a sequence of nucleotides in 
SEQ ID No. 5 or SEQ ID No. 1 5. 

40. A double-stranded RNA (dsRNA) molecule that comprises at least 
20 about 21 contiguous nucleotides or modified nucleotides from a sequence of 

nucleotides encoding an MTSP20 of any of claims 1-lB. 

41 . An antibody that specifically binds to the polpeptide of claim 1 , 
but that does not bind to MTSP20 molecules encoded by splice variants lacking 
a full-tenth protease 2 domain. 

25 42. An antibody that binds to the polpeptide of claim 1 with at least 

10-fold greater affinity than to a polypeptide encoded by SEQ ID No. 17. 

43. An antibody of claim 42 that binds with at least 100-fold greater 
affinity. 

44. An antibody of any of claims 41-43 that inhibits an enzymatic 
30 activity of the polypeptide. 

45. An antibody that specifically binds to the protease 2 domain (PD2) 
of an MTSP20 polypeptide. 



wo 03/004681 



PCT/US02/21208 



-180- 



46. The antibody of any of claims 41-45 that binds to a single- 
chain fornn of an MTSP20 polypeptide or a PD1 or PD2 protease domain thereof. 

47. The antibody of any of claims 41-45 that is a monoclonal 
antibody. 

5 48. The antibody of any of claims 41-45 that is a polyclonal antibody. 

49. A conjugate, comprising: 

a polypeptide of any of claims 1-18 or a polypeptide of SEQ ID No. 

17, and 

a targeting agent linked to the polypeptide directly or via a linker. 
10 50. The conjugate of claim 49, wherein the targeting agent permits 

affinity isolation or purification of the conjugate; 
attachment of the conjugate to a surface; 
detection of the conjugate; or 
targeted delivery to a selected tissue or cell. 
15 51. A combination, comprising: 

an agent or treatment that inhibits a catalytic activity of a 
polypeptide of any of claims 1-18 or a polypeptide of SEQ ID No. 17; and 

another treatment or agent selected from anti-tumor and 
anti-angiogenic treatments and agents. 
20 52. The combination of claim 51 , wherein the inhibitor and the anti- 

tumor and/or anti-angiogenic agent are formulated in a single pharmaceutical 
composition or each is formulated in separate pharmaceutical compositions. 

53. The combination of claim 52, wherein the inhibitor is selected from 
antibodies and antisense oligonucleotides and double-stranded RNA (dsRNA). 
25 54. A solid support comprising two or more polypeptides of claim 1 

and/ or a polypeptide of SEQ ID No. 17 linked thereto either directly or via a 
linker. 

55. The support of claim 54, wherein the polypeptides comprise an 

array. 

30 56. The support of claim 55, wherein the polypeptides comprise a 

plurality of protease domains. 
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57. A solid support comprising two or more nucleic acid molecules of 
claim 20 or oligonucleotide portions thereof linked thereto either directly or via a 
linker, wherein the oligonucleotides contain at least 16 nucleotides. 

58. The support of claim 57, wherein the nucleic acid molecules 
5 comprise an array. 

59. The support of claim 57 or claim 58, wherein the nucleic acid 
molecules comprise a plurality of molecules that encode protease domains. 

60. A method for identifying or screening compounds that modulate a 
protease activity of an MTSP20 polypeptide, comprising: 

10 contacting a polypeptide of any of claims 1-18 or a polypeptide of SEQ ID 

No. 17 or a proteolyttcally active portion of the polypeptide with a substrate that 
is proteolytically cleaved by the polypeptide, and, either simultaneously, before 
or after, adding a test compound or plurality thereof; 

measuring the amount of substrate cleaved in the presence of the test 
15 compound; and 

selecting compounds that change the amount of substrate cleaved 
compared to a control, whereby compounds that modulate an activity of the 
polypeptide are identified. 

61. The method of claim 60, wherein the test compounds are small 
20 molecules, peptides, peptldomlmetics, nucleic acid molecules, natural products, 

antibodies or fragments thereof that modulate an activity of the polypeptide. 

62. The method of claim 61 or claim 62, wherein a plurality of the test 
substances are screened simultaneously. 

63. The method of claim 61 or claim 62, wherein the polypeptide is 
25 encoded by a sequence of nucleotides selected from the group consisting of: 

(a) a sequence of nucleotides in SEQ ID No. 5 or SEQ ID No. 1 5 or a 
sequence of nucleotides that encodes the polyeptide of SEQ ID No. 17; 

(b) a sequence of nucleotides that hybridizes under high stringency along 
its length or along at least about 70% of the full-length to the sequence of 

30 nucleotides in SEQ ID No. 5 or SEQ ID No. 1 5 or a sequence of nucleotides that 
encodes the polyeptide of SEQ ID No. 17; 
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{c) a sequence of nucleotides that encodes the polypeptide of SEQ ID No. 
6 or SEQ ID No. 16 or SEQ ID No. 17; 

(d) a sequence of nucleotides that is a splice variant of (a), (b) or (c); 
(el a sequence of nucleotides that encodes a protease domain or a 
5 catalytically active portion thereof that includes a sequence of 

nucleotides having at least about 60%, 70%, 80%, 90% or 95% 
sequence identity with the sequence set forth in SEQ ID No. 5 or 
1 5; and 

(f) a sequence of nucleotides comprising degenerate codons of (a), 
10 (b), (c), (d) or (e). 

64. The method of any of claims 60-63, wherein the polypeptide is 
selected from the group consisting of: 

a polypeptide that consists essentially of a sequence of amino 
acids set forth in SEQ ID No. 6 or as amino acids 1 13-343 of SEQ ID No. 16 or 
15 as amino acids 376-642 of SEQ ID No. 16; 

a polypeptide that consists essentially of a sequence of amino 
acids encoded by a sequence of nucleotides that hybridizes under conditions of 
high stringency to the sequence of nucleotides in SEQ ID No. 5 or set forth as 
nucleotides 360-1052 or 1148-1952 in SEQ ID No. 15; 
20 a polypeptide that comprises the sequence of amino acids set 

forth as residues 376-642 SEQ ID No. 16; 

a polypeptide that comprises a sequence of amino acids having at 
least about 60% sequence identity with the sequence of amino acids in SEQ ID 
No. 6 or the sequence of amino acids in SEQ ID No. 16; and 
25 a polypeptide that is encoded by a sequence of nucleotides that is 

a splice variant of the sequence set forth in SEQ ID No. 15. 

65. The method of any of claims 60-64, wherein the change In the 
amount of substrate cleaved is assessed by comparing the amount of substrate 
cleaved in the presence of the test compound with the amount of substrate 

30 cleaved in the absence of the test compound. 

66. The method of any of claims 60-65, wherein a plurality of the 
polypeptides are linked to a solid support, either directly or via a linker. 
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67. The method of claim 66, wherein the polypeptides comprise an 

array. 

68. A method of identifying a compound that specifically binds to a 
single-chain and/or multi-chain MTSP20 protease domain and/or to a single or 

5 multi-chain MTSP20 polypeptide and/or to a proteolytically active portion of a 
single or a multi-chain form of an MTSP20 polypeptide, comprising: 

contacting an MTSP20 polypeptide of any of claims 1-18 or a 
polypeptide of SEQ ID No. 17 or a proteolytically active portion of the 
polypeptide with a test compound or plurality thereof under conditions 
10 conducive to binding thereof; and either: 

a) identifying test compounds that specifically bind to the single 
chain and/or multi chain form of the polypeptide or to a proteolytically active 
portion of the single and/or multi chain form thereof, or 

b) identifying test compounds that inhibit binding of a compound 
15 known to bind a single chain and/or multi chain form of the polypeptide or to a 

proteolytically active portion of the single and/or multi- chain form thereof, 
wherein the known compound is contacted with the polypeptide before, simul- 
taneously with or after the test compound, wherein the multi-chain form is a 
two, three or four chain polypeptide. 
20 69. The method of claim 68, wherein the polypeptide is linked either 

directly or indirectly via a linker to a solid support. 

70. The method of claim 68 or claim 69, wherein the test compounds 
are small molecules, peptides, peptidomimetics, natural products, antibodies or 
fragments thereof. 

25 71 . The method of any of claims 68-70, wherein a plurality of the test 

substances are screened simultaneously. 

72. The method of claim 71, wherein the polypeptides comprise an 

array. 

73. The method of any of claims 68-72, wherein the polypeptide is 
30 encoded by a sequence of nucleotides selected from the group consisting of: 

(a) a sequence of nucleotides In SEQ ID No. 5 or SEQ ID No. 15 or a 
sequence of nucleotides that encodes the polyeptide of SEQ ID No. 17; 
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(b) a sequence of nucleotides that hybridizes under high stringency along 
its length or along at least about 70% of the full-length to the sequence of 
nucleotides In SEQ ID No. 5 or SEQ ID No. 15 or a sequence of nucleotides that 
encodes the polyeptide of SEQ ID No. 17; 
5 (c) a sequence of nucleotides that encodes the polypeptide of SEQ ID No. 

6 or SEQ ID No,16 or SEQ ID No. 17; 

(d) a sequence of nucleotides that is a splice variant of (a), (b) or (c); 

(e) a sequence of nucleotides that encodes a protease domain or a 
catalytically active portion thereof that includes a sequence of 

10 nucleotides having at least about 60%, 70%, 80%, 90% or 95% 

sequence identity with the sequence set forth in SEQ ID No. 5 or 
1 5; and 

(f) a sequence of nucleotides comprising degenerate codons of (a), 
(b), (c), (d) or (e). 

1 5 74. The method of any of claims 68-73, wherein the polypeptide is 

selected from the group consisting of: 

a polypeptide that consists essentially of a sequence of amino 
acids set forth in SEQ ID No. 6 or as amino acids 113-343 of SEQ ID No. 16 or 
as amino acids 376-642 of SEQ ID No. 16. 
20 a polypeptide that consists essentially of a sequence of amino 

acids encoded by a sequence of nucleotides that hybridizes under conditions of 
high stringency to the sequence of nucleotides in SEQ ID No. 5 or set forth as 
nucleotides 360-1052 or 1 148-1952 in SEQ ID No. 15; 

a polypeptide that comprises the sequence of amino acids set 
25 forth as residues 376-642 SEQ ID No. 1 6; 

a polypeptide that comprises a sequence of amino acids having at 
least about 60% sequence identity with the sequence of amino acids in SEQ ID 
No. 6 or the sequence of amino acids in SEQ ID No. 16; and 

a polypeptide that is encoded by a sequence of nucleotides that is 
30 a splice variant of the sequence set forth in SEQ ID No. 15. 

75. A method for identifying activators of a zymogen or inactive form 
of an MTSP20, comprising: 
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contacting a zymogen form or a proteolytically inactive form of an 
. MTSP20 polypeptide of any of claims 1-18 or a polypeptide of SEQ ID No. 17 or 
a proteolytically active portion of the polypeptide with a substrate of the 
activated form of the polypeptide; 
5 adding a test compound, wherein the test compound is added 

before, after or simultaneously with the addition of the substrate; and 
detecting cleavage of the substrate, thereby identifying 
compounds that activate the zymogen. 

76. The method of claim 75, wherein the substrate is a chromogenic 
10 substrate. 

77. The method of claim 75, wherein the substrate is a 
L-pyroglutamyl-L-prolyl-L-arginine-p-nitroaniline hydrochloride. 

78. The method of any of claims 75-77, wherein the test compound is 
a small molecule, a nucleic acid or a polypeptide. 

15 79. A method for treating or preventing a neoplastic disease, in a 

mammal, comprising administering to a mammal an effective amount of an 
inhibitor of a polypeptide of any of claims V18 or a polypeptide of SEQ ID No. 
17. 

80. The method of claim 79, wherein the inhibitor is an antibody that 
20 specifically binds to the polypeptide, or a fragment or derivative of the antibody 

containing a binding domain thereof, wherein the antibody is a polyclonal 
antibody or a monoclonal antibody. 

81 . The method of claim 79 or claim 80, wherein the polypeptide is 
encoded by a sequence of nucleotides selected from the group consisting of: 

25 {a) a sequence of nucleotides in SEQ ID No. 5 or SEQ ID No. 15 or a 

sequence of nucleotides that encodes the polyeptide of SEQ ID No. 17; 

(b) a sequence of nucleotides that hybridizes under high stringency along 
Its length or along at least about 70% of the full-length to the sequence of 
nucleotides in SEQ ID No. 5 or SEQ ID No. 15 or a sequence of nucleotides that 

30 encodes the polyeptide of SEQ ID No. 17; 

(c) a sequence of nucleotides that encodes the polypeptide of SEQ ID No. 
6 or SEQ ID No. 16 or SEQ ID No. 17; 
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(d) a sequence of nucleotides that is a splice variant of (a), (b) or (c); 

(e) a sequence of nucleotides that encodes a protease domain or a 
catalytically active portion thereof that Includes a sequence of 
nucleotides having at least about 60%, 70%, 80%, 90% or 95% 

5 sequence identity with the sequence set forth in SEQ ID No. 5 or 

15; and 

(f) a sequence of nucleotides comprising degenerate codons of (a), 
(b), (c). (d) or (e). 

82. The method of any of claims 79-81, wherein the polypeptide is 
10 selected from the group consisting of: 

a polypeptide that consists essentially of a sequence of amino 
acids set forth in SEQ ID No. 6 or as amino acids 1 1 3-343 of SEQ ID No. 1 6 or 
as amino acids 376-642 of SEQ ID No. 16. 

a polypeptide that consists essentially of a sequence of amino 
15 acids encoded by a sequence of nucleotides that hybridizes under conditions of 
high stringency to the sequence of nucleotides in SEQ ID No. 5 or set forth as 
nucleotides 360-1052 or 1 148-1952 in SEQ ID No. 1 5; 

a polypeptide that comprises the sequence of amino acids set 
forth as residues 376-642 SEQ ID No. 16; 
20 a polypeptide that comprises a sequence of amino acids having at 

least about 60% sequence identity with the sequence of amino acids in SEQ ID 
No. 6 or the sequence of amino acids in SEQ ID No. 16; and 

a polypeptide that is encoded by a sequence of nucleotides that is 
a splice variant of the sequence set forth in SEQ ID No. 1 5. 
25 83. A method of inhibiting tumor Initiation, growth or progression or 

treating a malignant or pre-malignant condition, comprising administering an 
agent that inhibits activation cleavage of a protease domain of the zymogen form 
of an MTSP20 polypeptide of any of claims 1-18 or a polypeptide of SEQ ID No. 
17 or a proteolytically active portion of the polypeptide , upon cleavage, is 
30 proteolytically active, or Inhibits an activity of the activated form of MTSP20 or a 
portion thereof that is proteolytically active. 
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84. The method of claim 83, wherein the condition is a condition of 
the breast, cervix, prostate, lung, ovary or colon. 

85. The method of claim 83 or claim 84, wherein the agent is an 
antisense oligonucleotide, double-stranded RNA (dsRNA) or an antibody- 

5 86. The method of any of claims 83-85, further comprising 

administering another treatment or agent selected from anti-tumor and anti- 
angiogenic treatments or agents. 

87. The method of any of claims 83-86, wherein the polypeptide is 
encoded by a sequence of nucleotides selected from the group consisting of: 

10 (a) a sequence of nucleotides in SEQ ID No. 5 or SEQ ID No. 15 or a 

sequence of nucleotides that encodes the polyeptrde of SEQ ID No. 1 7; 

(b) a sequence of nucleotides that hybridizes under high stringency along 
its length or along at least about 70% of the full-length to the sequence of 
nucleotides in SEQ ID No. 5 or SEQ ID No. 15 or a sequence of nucleotides that 

15 encodes the polyeptide of SEQ ID No. 17; 

(c) a sequence of nucleotides that encodes the polypeptide of SEQ ID No. 
6 or SEQ ID No. 1 6 or SEQ ID No. 1 7; 

(d) a sequence of nucleotides that is a splice variant of (a), (b) or (c); 

(e) a sequence of nucleotides that encodes a protease domain or a 
20 catalytically active portion thereof that includes a sequence of 

nucleotides having at least about 60%, 70%, 80%, 90% or 95% 
sequence identity with the sequence set forth in SEQ ID No. 5 or 
15; and 

(f) a sequence of nucleotides comprising degenerate codons of (a), 
25 (b). (c), (d) or (e). 

88. The method of any of claims 83-86, wherein the polypeptide, 
wherein the polypeptide, wherein the polypeptide is selected from the group 
consisting of: 

a polypeptide that consists essentially of a sequence of amino 
30 acids set forth in SEQ ID No. 6 or as amino acids 1 13-343 of SEQ ID No. 16 or 
as amino acids 376-642 of SEQ ID No. 16. 
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a potypeptide that consists essentially of a sequence of amino 
acids encoded by a sequence of nucleotides that hybridizes under conditions of 
high stringency to the sequence of nucleotides in SEQ ID No. 5 or set forth as 
nucleotides 360-1052 or 1148-1952 in SEQ ID No. 15; 
5 a polypeptide that comprises the sequence of amino acids set 

forth as residues 376-642 SEQ ID No. 16; 

a polypeptide that comprises a sequence of amino acids having at 
least about 60% sequence identity with the sequence of amino acids in SEQ ID 
No. 6 or the sequence of amino acids in SEQ ID No. 16; and 
10 a polypeptide that is encoded by a sequence of nucleotides that is 

a splice variant of the sequence set forth in SEQ ID No. 15. 

89. A method of identifying a compound that binds to the single-chain 
and/or a multi-chain form of an MTSP20 polypeptide and/or to a proteolyticatly 
active portion of a single-chain and/or multi-chain form of an MTSP20 

15 polypeptide, comprising: 

contacting a test compound with both a single-chain and multi-chain form 
of the MTSP20 polypeptide and/or proteolytically active portion thereof; 
determining to which form<s) the compound binds; and 
if it binds to a form of polypeptide, further determining whether the 

20 compound has at least one of the following properties: 

(i) Inhibits activation cleavage of a zymogen form of a protease 
domain of the polypeptide; 

(ii) inhibits activity of a protease domain of a multi-chain or single- 

s- 
chain form; and 

25 (iii) inhibits dimerization of the polypeptide, wherein the multi-chain 

form is a two, three or four chain polypepeptide. 

90. A method of detecting neoplastic disease, comprising: detecting a 
polypeptide that comprises a polypeptide of any of claims 1-18 or the 
polypeptide of SEQ ID No. 17 in or from a sample, wherein the amount, the form 

30 and/or activity detected differs from the amount, the form and/or activity of 
polypeptide detected from a subject who does not have neoplastic disease. 
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91 . The method of claim 90. wherein the sample is selected from the 
group consisting of blood, urine, saliva, tears, synovial fluid, sweat, interstitial 
fluid, sperm, cerebrospinal fluid, ascites fluid, tumor tissue biopsy and circulating 
tumor cells. 

5 92. The method of claim 90 or claim 91, wherein polypeptide consists 

essentially of a protease domain or a plurality thereof. 

93. A method of diagnosing the presence of a pre-malignant lesion, a 
malignancy, or other pathologic condition in a subject, comprising: 
obtaining a biological sample from the subject; and 
10 exposing the sample or polypeptides isolated therefrom to a detectable 

agent that binds to a multi-chain and/or single-chain form of an MTSP20 
polypeptide, wherein the pathological condition is characterized by the presence 
or absence of the multi-chain or single-chain form, wherein the multi-chain form 
is a two, three or four chain polypepeptide. 
15 94. The method of claim 93, wherein the biological sample is selected 

from the group consisting of blood, urine, saliva, tears, synovial fluid, sweat, 
interstitial fluid, cerebrospinal fluid, a sperm sample, ascites fluid, tumor tissue 
biopsy and circulating tumor cells. 

95. A method of monitoring tumor progression and/or therapeutic 

20 effectiveness, comprising detecting and/or quantifying the level, the form and/or 
activity of an MTSP20 polypeptide in a body tissue or fluid sample. 

96. The method of claim 95, wherein the tumor is a tumor of the 
breast, cervix, prostate, lung, ovary or colon. 

97. The method of claim 95 or 96, wherein the body fluid is blood, 
25 urine, sweat, saliva, cerebrospinal fluid and synovial fluid. 

98. A transgenic non-human animal, comprising heterologous nucleic 
acid encoding a polypeptide of claim 1 or the polypeptide of SEQ ID No. 17. 

99. A probe or primer that: 

comprises at least 14 contiguous nucleotides or modified nucleotides that 
30 are identical to a contiguous sequence of nucleotides encoding the amino acids 
624-642 of SEQ ID No. 16; 
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comprises at least 16 contiguous nucleotides or modified nucleotides that 
are identical to a contiguous sequence of nucleotides encoding the amino acids 
624-642 of SEQ ID No. 1 6; or 

comprises at least 30 contiguous nucleotides or modified nucleotides that 
5 are identical to a contiguous sequence of nucleotides encoding the amino acids 
624-642 of SEQ ID No. 16. 

100. A method for treating or preventing a disease or disorder 
associated with undesired and/or uncontrolled anglogenesis or 
neovascularization, in a mammal, comprising administering to a mammal an 

10 effective amount of an inhibitor of an MTSP20 polypeptide. 

101. The method of claim 100, wherein the Inhibitor is selected from 
the group consisting of an antibody or a fragment or derivative thereof that 
specifically binds to the MTSP20, an antisense nucleic acid, a double-stranded 
RNA molecule. 

15 102. The method of claim 100, wherein the undesired angiogenesis is 

associated with disorders selected from the group consisting of solid neoplasms, 
vascular malformations and cardiovascular disorders, chronic inflammatory 
diseases and aberrant wound repairs, circulatory disorders, crest syndromes, 
dermatological disorders and ocular disorders. 

20 103. The method of claim 102, wherein: 

the vascular malformations and cardiovascular disorders are selected from 
the group consisting of angiofibroma, angiolipoma, atherosclerosis, 
restenosis/reperfusion injury, arteriovenous malformations, hemangiomatosis and 
vascular adhesions, dyschondroplasia with vascular hamartomas (Fafucci's 

25 syndrome), hereditary hemorrhagic telangiectasia {Rendu-Osler-Weber syndrome) 
and Von Hippie Lindau syndrome; 

the chronic inflammatory diseases are selected from the group consisting 
of diabetes mellitus, hemophiliac joints, inflammatory bowel disease, nonhealing 
fractures, periodontitis (rapidly progressing and juvenile), psoriasis, rheumatoid 

30 arthritis, venous stasis ulcers, granulations-burns, hypertrophic scars, liver 

cirrhosis, osteoradionecrosis, postoperative adhesions, pyogenic granuloma and 
systemic sclerosis; 
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the circulatory disorder is Raynaud's phenomenon; 

the crest syndromes are selected from the group consisting of calcinosis, 
esophageal, dyomotiloty, sclerodactyly and teangiectasis; 

the dermatological disorders are selected from the group consisting of 
5 systemic vasculitis, scleroderma, pyoderma gangrenosum, vasculopathy, 
venous, arterial ulcers, Sturge-Weber syndrome. Port-wine stains, blue rubber 
bleb nevus syndrome, Ktippel-Trenaunay-Weber syndrome and Osler-Weber- 
Rendu syndrome; 

the ocular disorders are selected from the group consisting of blindness 
10 caused by ocular neovascular disease, corneal graft neovascularization, macular 
degeneration in the eye, neovascular glaucoma, trachoma, diabetic retinopathy, 
myopic degeratlon. retinopathy of prematurity, retrolental fibroplasia and corneal 
neovascularization. 

104. A combination, comprising: 

15 a} an inhibitor of the proteolytic activity of the MTSP20; and 

b) an another treatment or agent selected from anti-tumor and anti- 
anglogenic treatments or agents. 

105. The combination of claim 104, wherein the inhibitor and the anti- 
angiogenic agent are formulated in a single pharmaceutical composition or each 

20 is formulated in separate pharmaceutical compositions. 

106. The combination of 104, wherein the inhibitor is selected from 
antibodies, antisense oligonucleotides and double-stranded RNA molecules. 

107. A computational method for screening, comprising: 
assessing the interaction of a test compound with a computer- 

25 simulated polypeptide that has the sequence of amino acids of a polypeptide of 
any of claims 1-18 or an MTSP20 polypeptide that comprises the sequence of 
amino acid set forth in SEQ ID No, 1 7; and 

identifying test compounds that interact with the polypeptide, 
wherein assessment is effected in silico. 
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108. The polypeptide of claim 1, wherein: 

the l\/ITSP20 portion of the polypeptide consists essentially of 
protease domain 2 (PD2) of the MTSP20 or a catalyticalty active portion 
thereof. 
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SEQUENCE LISTING 



<110> Corvae International, Inc. 
Madison, Edwin 
Ong , Edgar 

<X20> NUCLEIC ACID MOLECULES ENCODING 

TRANSMEMBRANE SERINE PROTEASE 20, THE ENCODED POLYPEPTIDES AND 
METHODS BASED THEREON 

<130> 24745-1618PC 

<140> Not Yet Assigned 
<141> Herewith 

<160> 14 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 3147 

<212 5. DNA 

<213> Homo Sapien 

<220> 
<221> CDS 

<222> (23) . . . (2589) 

<:223> Nucleotide sequence encoding MTSPl 
<300> 

<301> O'Brien, T.J. and Tanimoto, H, 

<308> GenBank #AR081724 

<309> 2000-08-31 

<310> 5,972,616 

<311> 1998-02-20 

<312> 1999-10-26 

<400> 1 

tcaagagcgg cctcggggta cc atg ggg age gat egg gcc cgc aag ggc gga 52 

Met Gly Ser Asp Arg Ala Arg Lys Gly Gly 
IS 10 

ggg ggc ccg aag gac ttc ggc gcg gga etc aag tac aac tec egg cac 100 
Gly Gly Pro Lys Asp Phe Gly Ala Gly Leu Lys Tyr Asn Ser Arg His 

15 20 25 

gag aaa gtg aat ggc ttg gag gaa ggc gtg gag ttc ctg cca gtc aac 14 8 

Glu Lys Val Asn Gly Leu Glu Glu Gly Val Glu Phe Leu Pro Val Asn 
30 35 40 

aac gtc aag aag gtg gaa aag cat ggc ccg ggg cgc tgg gtg gtg ctg 196 
Asn Val Lys Lys Val Glu Lys His Gly Pro Gly Arg Trp Val Val Leu 
45 50 55 



gca gcc gtg ctg ate ggc etc etc ttg gtc ttg ctg ggg ate ggc ttc 
Ala Ala Val Leu lie Gly Leu Leu Leu Val Leu Leu Gly He Gly Phe 
60 65 70 



244 
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ctg gtg tgg cat ttg cag tac egg gac gtg cgt gtc cag aag gtc ttc 292 
Leu Val Trp His Leu Gin Tyx Arg Asp Val Arg Val Gin Lys Val Phe 
75 80 85 90 



aat ggc tac atg agg ate aca aat gag aat ttt gtg gat gcc tac gag 
Asn Gly Tyr Met Arg He Thr Asn Glu Asn Phe Val Asp Ala Tyr Glu 

95 100 105 



gcg ctg aag ctg ctg tac age gga gtc cca ttc ctg ggc ccc tac cac 
Ala Leu Lys Leu Leu Tyr Ser Gly Val Pro Phe Leu Gly Pro Tyr His 
125 130 135 

aag gag teg get gtg acg gcc ttc age gag ggc age gtc ate gcc tac 
Lya Glu Ser Ala Val Thr Ala Phe Ser Glu Gly Ser Val He Ala Tyr 
140 145 ISO 



gac gcc gac tea gtg ctg age etc acc ttc cgc age ttt gac ctt gcg 
Asp Ala Asp Ser Val Leu Ser Leu Thr Phe Arg Ser Phe Asp Leu Ala 

255 260 265 

tec tge gac gag cgc ggc age gac ctg gtg acg gtg tac aac acc ctg 
Ser Cys Asp Glu Arg Gly Ser Asp Leu Val Thr Val Tyr Asn Thr Leu 
270 275 280 

age .ccc atg gag ccc cac gcc ctg gtg cag ttg tgc ggc acc tac cct 
Ser Pro Met Glu Pro His Ala Leu Val Gin Leu Cys Gly Thr Tyr Pro 
285 290 295 

ccc tec tac aac ctg acc ttc cac tec tec cag aac gtc ctg etc ate 
Pro Ser Tyr Asn Leu Thr Phe His Ser Ser Gin Asn Val Leu Leu He 
300 305 310 



340 



aac tec aac tec act gag ttt gta age ctg gcc age aag gtg aag gac 388 
Asn Ser Asn Ser Thr Glu Phe Val Ser Leu Ala Ser Lys Val Lys Asp 
110 115 120 



436 



484 



628 



tac tgg tct gag ttc age ate ccg cag cac ctg gtg gag gag gee gag 532 
Tyr Trp Ser Glu Phe Ser He Pro Gin His Leu Val Glu Glu Ala Glu 
155 160 165 170 

cgc gtc atg gcc gag gag cgc gta gtc atg ctg ccc ccg egg gcg cgc 580 
Arg Val Met Ala Glu Glu Arg Val Val Met Leu Pro Pro Arg Ala Arg 

175 180 185 

tec ctg aag tec ttt gtg gtc acc tea gtg gtg get ttc ccc acg gac 
Ser Leu Lys Ser Phe Val Val Thr Ser Val Val Ala Phe Pro Thr Asp 
190 195 200 

tec aaa aca gta cag agg acc cag gac aac age tgc age ttt ggc ctg 676 
Ser Lys Thr Val Gin Arg Thr Gin Asp Asn Ser Cys Ser Phe Gly Leu 
205 210 215 

cac gee cgc ggt gtg gag ctg atg cgc ttc acc acg ccc ggc ttc cct 724 
His Ala Arg Gly Val Glu Leu Met Arg Phe Thr Thr Pro Gly Phe Pro 
220 225 230 

gac age ccc tac ccc get cat gcc cgc tgc cag tgg gcc ctg egg ggg 772 
Asp Ser Pro Tyr Pro Ala His Ala Arg Cye Gin Trp Ala Leu Arg Gly 
235 240 245 250 



820 



868 



916 



964 
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aca ctg ata acc aac act gag egg egg cat ccc ggc ttt gag gcc acc 1012 
Thr Leu lie Thr Asn Thr Glu Arg Arg His Pro Gly Phe Glu Ala Thr 
315 320 325 330 

ttc ttc cag ctg cct agg atg age age tgt gga ggc cgc tta cgt aaa 1060 
Phe Phe Gin Leu Pro Arg Met Ser Ser Cys Gly Gly Arg Leu Arg Lys 

335 340 345 

gcc cag ggg aca ttc aac age ccc tae tac cca ggc cac tac cca ccc llOB 
Ala Gin Gly Thr Phe Asn Ser Pro Tyr Tyr Pro Gly His Tyr Pro Pro 
350 355 360 

aac att gac tgc aca tgg aac att gag gtg ccc aac aac cag cat gtg 1156 
Asn lie Asp Cys Thr Trp Asn lie Glu Val Pro Asn Asn Gin His Val 
365 3.70 375 

aag gtg age ttc aaa ttc ttc tac ctg ctg gag ccc ggc gtg cct gcg 1204 
Lys Val Ser Phe Lys Phe Phe Tyr Leu Leu Glu Pro Gly Val Pro Ala 
380 385 390 

ggc acc tgc ccc aag gac tac gtg gag ate aat ggg gag aaa tac tgc 1252 
Gly Thr Cys Pro Lys Asp Tyr Val Glu lie Asn Gly Glu Lys Tyr Cys 
395 400 405 410 

gga gag agg tec cag ttc gtc gtc acc age aac age aac aag ate aca 1300 
Gly Glu Arg Ser Gin Phe Val Val Thr Ser Asn Ser Asn Lys lie Thr 

415 420 425 

gtt cgc ttc cac tea gat cag tec tac acc gac acc ggc ttc tta get 1348 
Val Arg Phe His Ser Asp Gin Ser Tyr Thr Asp Thr Gly Phe Leu Ala 
430 435 440 

gaa tac etc tec tac gac tec agt gac cca tgc ccg ggg cag ttc acg 13 96 
Glu Tyr Leu Ser Tyr Asp Ser Ser Asp Pro Cys Pro Gly Gin Phe Thr 
445 450 455 

tgc cgc acg ggg egg tgt ate egg aag gag ctg cgc tgt gat ggc tgg 1444 
Cys Arg Thr Gly Arg Cys lie Arg Lys Glu Leu Arg Cys Asp Gly Trp 
460 465 470 

gcc gac tgc acc gac cac age gat gag etc aac tgc agt tgc gac gcc 1492 
Ala Asp Cys Thr Asp His Ser Asp Glu Leu Asn Cys Ser Cys Asp Ala 
475 480 485 490 

ggc cac cag ttc acg tgc aag aac aag ttc tgc aag ccc etc ttc tgg 1540 
Gly His Gin Phe Thr Cys Lys Asn Lys Phe Cys Lys Pro Leu Phe Trp 

495 500 505 

gtc tgc gac agt gtg aac gac tgc gga gac aac age gac gag cag ggg 1588 
Val Cys Asp Ser Val Asn Asp Cys Gly Asp Asn Ser Asp Glu Gin Gly 
510 515 520 

tgc agt tgt ccg gcc cag acc ttc agg tgt tec aat ggg aag tgc etc 163 6 

Cys Ser Cys Pro Ala Gin Thr Phe Arg Cys Ser Asn Gly Lys Cys Leu 
525 530 535 

teg aaa age cag cag tgc aat ggg aag gac gac tgt ggg gac ggg tec 1684 
Ser Lys Ser Gin Gin Cys Asn Gly Lys Asp Asp Cys Gly Asp Gly Ser 
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540 545 550 

gac gag gcc tec tgc ccc aag gtg aac gtc gtc act tgt acc aaa cac 1732 
Asp Glu Ala Ser Cys Pro Lys Val Asn Val Val Thr Cys Thr Lya His 
555 5S0 565 570 

acc tac cgc tgc etc aat ggg etc tgc ttg age aag gge aac cct gag 1780 
Thr Tyr Arg Cys Leu Asn Gly Leu Cys Leu Ser Lys Gly Asn Pro Glu 

575 580 585 

tgt gac ggg aag gag gac tgt age gac ggc tea gat gag aag gac tgc 182 8 

Cys Asp Gly Lys Glu Asp Cys Ser Asp Gly Ser Asp Glu Lys Asp Cys 
590 595 600 

gac tgt ggg ctg egg tea ttc acg aga cag get cgt gtt gtt ggg ggc 1876 
Asp Cys Gly Leu Arg Ser Phe Thr Arg Gin Ala Arg Val Val Gly Gly 
605 610 615 

acg gat gcg gat gag ggc gag tgg ccc tgg cag gta age ctg cat get 1924 
Thr Asp Ala Asp Glu Gly Glu Trp Pro Trp Gin Val Ser Leu His Ala 
620 625 630 

ctg ggc cag ggc cac ate tgc ggt get tec etc ate tct ccc aac tgg 1972 
Leu Gly Gin Gly His lie Cys Gly Ala Ser Leu lie Ser Pro Asn Trp 
635 640 645 650 

ctg gtc tct gcc gea cac tgc tac ate gat gac aga gga ttc agg tac 2020 
Leu Val Ser Ala Ala His Cys Tyr lie Asp Asp Arg Gly Phe Arg Tyr 

655 660 665 

tea gac ccc acg cag tgg acg gcc ttc ctg ggc ttg cac gac cag age 2068 
Ser Asp Pro Thr Gin Trp Thr Ala Phe Leu Gly Leu His Asp Gin Ser 
€70 675 680 

cag cgc age gee cct ggg gtg cag gag cgc agg etc aag cgc ate ate 2116 
Gin Arg Ser Ala Pro Gly Val Gin Glu Arg Arg Leu Lys Arg lie lie 
685 690 695 

tec cac ccc ttc ttc aat gac ttc acc ttc gac tat gac ate gcg ctg 2164 
Ser His Pro Phe Phe Asn Asp Phe Thr Phe Asp Tyr Asp lie Ala Leu 
700 705 710 

ctg gag ctg gag aaa ccg gca gag tac age tec atg gtg egg ccc ate 2212 
Leu Glu Leu Glu Lys Pro Ala Glu Tyr Ser Ser Met Val Arg Pro lie 
715 720 725 730 

tgc ctg ccg gac gcc tec cat gtc ttc cct gee ggc aag gcc ate tgg 226 0 

Cys Leu Pro Asp Ala Ser His Val Phe Pro Ala Gly Lys Ala lie Trp 

735 740 745 

gtc acg ggc tgg gga cac acc cag tat gga ggc act gge gcg ctg ate 23 08 

Val Thr Gly Trp Gly His Thr Gin Tyr Gly Gly Thr Gly Ala Leu He 
750 755 760 

ctg caa aag ggt gag ate cgc gtc ate aac cag acc acc tgc gag aac 23 56 

Leu Gin Lys Gly Glu He Arg Val He Asn Gin Thr Thr Cys Glu Asn 
765 770 775 

etc ctg ccg cag cag ate acg ccg cgc atg atg tgc gtg ggc ttc etc 24 04 
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Leu Leu Pro Gin Gin He Thr Pro Arg Met Met Cys Val Gly Phe Leu 
780 785 790 



age ggc ggc gtg gac tec tgc cag ggt gat tec ggg gga ccc ctg tec 
Ser Gly Gly Val Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro Leu Ser 
795 800 80S 810 



2452 



age gtg gag gcg gat ggg egg ate ttc cag gee ggt gtg gtg age tgg 
Ser Val Glu Ala Asp Gly Arg He Phe Gin Ala Gly Val Val Ser Trp 

815 B20 825 



2500 



gga gac ggc tgc get cag agg aae aag cca ggc gtg tae aea agg etc 
Gly Asp Gly Cys Ala Gin Arg Asn Lys Pro Gly Val Tyr Thr Arg Leu 
830 835 840 



2548 



cct ctg ttt egg gac tgg ate aaa gag aac act ggg gta ta ggggecgggg 
Pro Leu Phe Arg Asp Trp He Lys Glu Asn Thr Gly Val 
845 850 855 



2599 



ccacccaaat 
ggctggagac 
ctceagggct 
gggaggtaga 
agacacagcc 
cccctgtctg 
ggctgccgga 
aggaccctgg 
ctteagtgtg 
aaaaaaaa 



gtgtacacct 
tggaccgctg 
ccaaatctge 
aggggaggac 
tcccecgcca 
taaggagcag 
tetgggctgt 
aaaacagaeg 
tgtatttgtg 



gcggggccac 
actgcaccag 
ctagaaaacc 
actggtggtt 
gccecaagct 
cgggaacgga 
ggggcccttg 
ggtctgagae 
taaatgggta 



ccategteca 
cgcccccaga 
tctcgcttcc 
ctactgaecc 
gggccgaggc 
gcttcggagc 
ggccacgetc 
tgaaattgtt 
aaacaattta 



ccccagtgtg 
acatacactg 
tcageeteea 
aactggggge 
gcgtttgtgt 
ctcctcagtg 
ttgaggaagc 
ttaccagctc 
tttcttttta 



cacgectgca 2659 

tgaaetcaat 2719 

aagtggagct 2779 

aaaggtttga 283 9 

atatetgcct 2899 

aaggtggtgg 2959 

ccaggctcgg 3019 

ccagggtgga 3079 

aaaaaaaaaa 3139 
3147 



<210> 2 

<211> 855 

<212> PRT 

<:213> Homo Sapien 



<400> 2 






Met 


Gly 


Ser 


Asp 


1 








Gly Ala 


Gly 


Leu 








20 


Glu 


Glu 


Gly 


Val 






35 




Lys 


His 


Gly 


Pro 




50 






Leu 


Leu 


Leu 


Val 


65 








Tyr Arg 


Asp 


Val 


Thr 


Asn 


Glu 


Asn 








100 


Phe 


Val 


Ser 


Leu 






115 




Ser 


Gly 


Val 


Pro 




130 






Ala 


Phe 


Ser 


Glu 


145 








He 


Pro 


Gin 


His 


Arg 


Val 


Val 


Met 



Arg 


Ala 


Arg 


Lys 


5 








Lys 


Tyr 


Asn 


Ser 


Glu 


Phe 


Leu 


Pro 








40 


Gly 


Arg 


Trp 


Val 






55 




Leu 


Leu 


Gly 


He 




70 






Arg 


Val 


Gin 


Lys 


85 








Phe 


Val 


Asp 


Ala 


Ala 


Ser 


Lys 


Val 








120 


Phe 


Leu 


Gly 


Pro 






135 




Gly 


Ser 


Val 


He 




150 






Leu 


Val 


Glu 


Glu 


165 








Leu 


Pro 


Pro 


Arg 



Gly 


Gly 


Gly 


Gly 




10 






Arg 


His 


Glu 


Lys 


25 








Val 


Asn 


Asn 


Val 


Val 


Leu 


Ala 


Ala 








60 


Gly 


Phe 


Leu 


Val 






75 




Val 


Phe 


Asn 


Gly 




90 






Tyr 


Glu 


Asn 


Ser 


105 








Lys 


Asp 


Ala 


Leu 


Tyr 


His 


Lys 


Glu 








140 


Ala 


Tyr 


Tyr 


Trp 






155 




Ala 


Glu 


Arg 


val 




170 






Ala 


Arg 


Ser 


Leu 



Pro 


Lys 


Asp 


Phe 






15 




Val 


Asn 


Gly 


Leu 




30 






Lys 


Lys 


Val 


Glu 


45 








Val 


Leu 


He 


Gly 


Trp 


His 


Leu 


Gin 








80 


Tyr 


Met 


Arg 


He 






95 




Asn 


Ser 


Thr 


Glu 




110 






Lys 


Leu 


Leu 


Tyr 


125 








Ser 


Ala 


Val 


Thr 


Ser 


Glu 


Phe 


Ser 








160 


Met 


Ala 


Glu 


Glu 






175 




Lys 


Ser 


Phe 


Val 
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T HO 










Val 
V 




Ser 


Val 


val 


Ala 


Phe 


Pro 






^ J ^ 










200 


X. nx^ 


nl n 




fid ^4 


Ser 


Cys 


Ser 


Phe 




^ u 








215 






Met 




Phe 


Thr 


Thr 


Pro 


Gly 


^ ^ D 








230 






His 






Cvs 


Gin 


Trp 


Ala 


Leu 










245 








sex 


T .A1 1 

Jjcu 






Arg 


Ser 


Phe 


Asp 








^ o u 










sex 


Asp 


JjcU 


VAX 


Thr 


Val 


Tyr 


Asn 














2B0 






V ax 


flln 


Leu 


Cys 


Gly 


Thr 




290 










295 






Li-i e, 


OCX. 




Gin 


Asn 


Val 


Leu 


O V O 










310 






uJLU 


MX 2 


Axg 


Hi R 


Pro Gly 


Phe 


Glu 








325 








nec 


O A -V* 

oex 


ocX 


s 


Gly Gly 


Arg 


Leu 








34 0 

^ 1/ 










A 
OCX 


Pro 


lyr 


lyr 


Pro 


Gly 


His 


Tyr 






355 










360 


Asn 


xxe 


(jXU 


V a.X 


Pro 


Asn 


Asn 


Gin 














375 




nit A 

trie 


Tyr 


XjCU 


JjcU 


Glu 


Pro 


Gly 


Val 


385 








390 






Tyr 


VdX 


IjXU 


T 1 *• 
XXC 


Asn 


Gly 


Glu 


Lys 








405 








VAX 




1 IIX. 




Asn 


Ser 


Asn 


Lys 








420 












oCX 


lyr 


nx 


Asp 


Thr 


Gly Phe 






*■ W 










440 


OCX 


OCX 






Cys 


Pro 


Gly Gin 




450 










455 




T 1 Q 

J. xe 


Axg 




vxXU 


Leu 


Arg 


Cys 


Asp 


465 








470 






Ser 


Asp 


Glu 


Leu 


Asn 


Cys 


Ser 


Cys 








485 








Lys 


Asn 


Lys 


Phe 


Cys 


Lys 


Pro 


Leu 








500 










Asp 


Cys 


Gly 


Asp 


Asn 


Ser 


Asp 


Glu 






515 










520 


Thr 


Phe 


Arg 


Cys 


Ser 


Asn 


Gly 


Lys 




530 








535 




Asn 


Gly 


Lys 


Asp 


Asp 


Cys 


Gly Asp 


545 










550 






Lys 


Val 


Asn 


Val 


Val 


Thr 


Cys 


Thr 








565 








Gly 


Leu 


Cys 


Leu 


Ser 


Lys 


Gly Asn 






580 










Cys 


Ser 


Asp 


Gly 


Ser 


Asp 


Glu 


Lys 




595 










600 


Phe 


Thr 


Arg 


Gin 


Ala 


Arg 


Val 


val 




610 










615 




Glu 


Trp 


Pro 


Trp 


Gin 


Val 


Ser 


Leu 


625 








630 






Cys 


Gly 


Ala 


Ser 


Leu 


He 


Ser 


Pro 



IBS 










190 






Thr 


Asp 


Ser 


Lys 


Thr 


Val 


Gin 


Arg 










205 








Gly 


Leu 


His 


Ala 


Arg 


Gly 


Val 


Glu 








220 










Phe 


Pro 


Asp 


Ser 


Pro' 


Tyr 


Pro 


Ala 






235 










240 


Arg Gly 


Asp 


Ala 


Asp 


Ser 


Val 


Leu 




250 










255 




Leu 


Ala 


Ser 


Cys 


Asp 


Glu 


Arg 


Gly 


265 










270 






Thr 


Leu 


Ser 


Pro 


Met 


Glu 


Pro 


His 










2Q5 








Tvr 


Pro 


Pro 


Ser 


Tyr 

* 


Asn 


Leu 


Thr 








300 










Leu 


He 


Thr 


Leu 


He 


Thr 


Asn 


Thr 






315 










320 


Ala 


Thr 


Phe 


Phe 


Gin 


Leu 


Pro 


Arg 




330 










335 




Arcr 


Lvs 


Ala 


Gin 


Gly 


Thr 


Phe 


Asn 


345 










350 






Pro 


Pro 


Asn 


lie 


Asp 


Cys 


Thr 


Trp 










365 








His 


Val 


Lvs 


Val 


Ser 


Phe 


Lys 


Phe 








380 










Pro 


Ala 


Gly 


Thr 


Cys 


Pro 


Lys 


Asp 






3 95 










400 


Ay t 




Gly 


Glu 


Arq 


Ser 


Gin 


Phe 




410 










415 




He 


Thr 


Val 


Arg 


Phe 


His 


Ser 


Asp 


425 










430 






Leu 


Ala 


Glu 


Tvr 


Leu 


Ser 


Tyr 


Asp 










445 








Phe 


Thr 


CV6 


Arg 


Thr 


Gly 


Arg 


Cys 








460 










Gly 


Trp 


Ala 


Aso 


Cvs 


Thr 


Asp 


His 






475 










480 


Asp 


Ala 


Gly 


His 


Gin 


Phe 


Thr 


Cys 




490 










495 




Phe 


Trp 


Val 


Cys 


Asp 


Ser 


Val 


Asn 


505 










510 






Gin 


Gly 


Cys 


Ser 


Cys 


Pro 


Ala 


Gin 










525 








Cys 


Leu 


Ser 


Lys 


Ser 


Gin 


Gin 


Cys 






540 










Gly 


Ser 


Asp 


Glu 


Ala 


Ser 


Cys 


Pro 




555 










560 


Lys 


His 


Thr 


Tyr 


Arg 


Cys 


Leu 


Asn 




570 










575 




Pro 


Glu 


Cys 


Asp 


Gly 


Lys 


Glu 


Asp 


585 










590 






Asp 


Cys 


Asp 


Cys 


Gly 


Leu 


Arg 


Ser 






605 








Gly 


Gly 


Thr 


Asp 


Ala 


Asp 


Glu 


Gly 






620 










His 


Ala 


Leu 


Gly 


Gin 


Gly 


His 


He 






635 










640 


Asn 


Trp 


Leu 


Val 


Ser 


Ala 


Ala 


His 
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645 



CVB 


Tvx 


He Asp Asp Arg Gly 


Phe 






660 






Ala 


Phe Leu Gly Leu His 


Asp 






675 


660 


Val 


Gin 


Glu Arg Arg Leu LyB 


Arg 




690 


695 




Asp 


Phe 


Thr Phe Asp Tyr Asp 


He 


705 




710 




Ala 


Glu 


Tyr Ser Ser Met Val 


Arg 






725 




His 


Val 


Phe Pro Ala Gly Lys 


Ala 






740 




Thr 


Gin 


Tyr Gly Gly Thr Gly Ala 






755 


760 


Arg 


val 


Xle Asn Gin Thr Thr 


Cys 


770 


775 




Thr 


Pro 


Arg Met Met Cys Val 


Gly 


785 




790 




Cys 


Gin 


Gly Asp Ser Gly Gly 


Pro 






805 




Arg 


He 


Phe Gin Ala Gly Val 


Val 






820 




Arg 


Asn 


Lys Pro Gly Val Tyr 


Thr 






B35 


840 


He 


Lys 


Glu Asn Thr Gly Val 






B50 


855 







650 










V 




Arg Tyr 


Ser 


Asp 


Pro 


Thr 




j.rp 


665 










670 






Gin 


Ser 


Gin 


Arg 


Ser 


Ala 


Pro 


Gly 










685 








He 


He 


Ser 


His 


Pro 


Phe 


Phe 


Asn 








700 










Ala 


Leu 


Leu 


Glu 


Leu 


Glu 


Lys 


Pro 






715 








72 0 


Pro 


He 


Cys 


Leu 


Pro 


Asp 


Ala 


Ser 




730 










735 




He 


Trp 


Val 


Thr Gly Trp 


Gly 


His 


745 










750 






Leu 


He 


Leu 


Gin 


Lys 


Gly 


Glu 


He 










765 








Glu 


Asn 


Leu 


Leu 


Pro 


Gin 


Gin 


He 








780 










Phe 


Leu 


Ser 


Gly 


Gly 


Val 


Asp 


Ser 






795 










800 


Leu 


Ser 


Ser 


Val 


Glu 


Ala 


Asp 


Gly 




810 










815 




Ser Trp Gly Asp 


Gly Cys 


Ala 


Gin 


825 










830 






Arg 


Leu 


Pro 


Leu 


Phe 


Arg 


Asp 


Trp 



845 



<210> 3 
<211> 3147 
<212> DNA 
<:213> Homo Sapien 



<220> 
<221> CDS 

<222> (1865) . . . (2590) 

<223> Nucleic acid sequence of protease domain of MTSPl 
<400> 3 

tcaagagcgg cctcggggta ccatggggag cgatcgggcc cgcaagggcg gagggggccc 60 

gaaggacttc ggcgcgggac tcaagtacaa ctcccggcac gagaaagtga atggcttgga 120 

ggaaggcgtg gagttcctgc cagtcaacaa cgtcaagaag gtggaaaagc atggcccggg 180 

gcgctgggtg gtgctggcag ccgtgctgat cggcctcctc ttggtcttgc tggggatcgg 240 

cttcctggtg tggcatttgc agtaccggga cgtgcgtgtc cagaaggtct tcaatggcta 3 00 

catgaggatc acaaatgaga attttgtgga tgcctacgag aactccaact ccactgagtt 360 

tgtaagcctg gccagcaagg tgaaggacgc gctgaagctg ctgtacagcg gagtcccatt 420 

cctgggcccc taccacaagg agtcggctgt gacggccttc agcgagggca gcgtcatcgc 480 

ctactactgg tctgagttca gcatcccgca gcacctggtg gaggaggccg agcgcgtcat 54 0 

ggccgaggag cgcgtagtca tgctgccccc gcgggcgcgc tccctgaagt cctttgtggt 600 

cacctcagtg gtggctttcc ccacggactc caaaacagta cagaggaccc aggacaacag 6 60 

ctgcagcttt ggcctgcacg cccgcggtgt ggagctgatg cgcttcacca cgcccggctt 720 

ccctgacagc ccctaccccg ctcatgcccg ctgccagtgg gccctgcggg gggacgccga 780 

ctcagtgctg agcctcacct tccgcagctt tgaccttgcg tcctgcgacg agcgcggcag 84 0 

cgacctggtg acggtgtaca acaccctgag ccccatggag ccccacgccc tggtgcagtt 900 

gtgtggcacc taccctccct cctacaacct gaccttccac tcctcccaga acgtcctgct 960 

catcacactg ataaccaaca ctgagcggcg gcatcccggc tttgaggcca ccttcttcca 102 0 

gctgcctagg atgagcagct gtggaggccg cttacgtaaa gcccagggga cattcaacag 1080 

cccctactac ccaggccact acccacccaa cattgactgc acatggaaca ttgaggtgcc 114 0 

caacaaccag catgtgaagg tgagcttcaa attcttctac ctgctggagc ccggcgtgcc 1200 
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tgcgggcacc tgccccaagg actacgtgga gatcaatggg gagaaatact gcggagagag 12 60 

gtcccagttc gtcgtcacca gcaacagcaa caagatcaca gttcgcttcc actcagatca 1320 

gtcctacacc gacaccggct tcttagctga atacctctcc tacgactcca gtgacccatg 13 80 

cccggggcag ttcacgtgcc gcacggggcg gtgtatccgg aaggagctge gctgtgatgg 1440 

ctgggccgac tgcaccgacc acagcgatga gctcaactgc agttgcgacg ccggccacca 1500 

gttcacgtgc aagaacaagt tctgcaagcc cctcttctgg gtctgcgaca gtgtgaacga 1560 

ctgcggagac aacagcgacg agcaggggtg cagttgtccg gcccagacct tcaggtgttc 1620 

caatgggaag tgcctctcga aaagccagca gtgcaatggg aaggacgact gtggggacgg 1680 

gtccgacgag gcctcctgcc ccaaggtgaa cgtcgtcact tgtaccaaac acacctaccg 1740 

ctgcctcaat gggctctgct tgagcaaggg caaccctgag tgtgacggga aggaggactg 1800 

tagcgacggc tcagatgaga aggactgcga ctgtgggctg cggtcattca cgagacaggc 1860 

tcgt gtt gtt ggg ggc acg gat gcg gat gag ggc gag tgg ccc tgg cag 1909 
Val Val Gly Gly Thr Asp Ala Asp Glu Gly Glu Trp Pro Trp Gin 
15 10 15 

gta age ctg cat get ctg ggc cag ggc cac ate tgc ggt get tec etc 1957 
Val Ser Leu His Ala Leu Gly Gin Gly His lie Cys Gly Ala Ser Leu 

20 25 30 

ate tet ccc aac tgg ctg gtc tct gcc gca cac tgc tac ate gat gac 2005 
He Ser Pro Abu Trp Leu Val Ser Ala Ala His Cys Tyr He Asp Asp 
35 40 45 

aga gga ttc agg tac tea gac ccc acg cag tgg acg gcc ttc ctg ggc 2 053 

Arg Gly Phe Arg Tyr Ser Asp Pro Thr Gin Trp Thr Ala Phe Leu Gly 
50 55 60 

ttg cac gac cag age cag cgc age gcc cet ggg gtg cag gag cgc agg 2101 
Leu His Asp Gin Ser Gin Arg Ser Ala Pro Gly Val Gin Glu Arg Arg 
65 70 75 

etc aag cgc ate ate tec cac ccc ttc ttc aat gac ttc acc ttc gac 2149 
Leu Lys Arg He He Ser His Pro Phe Phe Asn Asp Phe Thr Phe Asp 
BO 85 90 95 

tat gac ate gcg ctg ctg gag ctg gag aaa ccg gca gag tac age tec 2197 
Tyr Asp He Ala Leu Leu Glu Leu Glu Lys Pro Ala Glu Tyr Ser Ser 

100 105 110 

atg gtg egg ccc ate tgc ctg ccg gac gcc tec cat gtc ttc cet gee 2245 
Met Val Arg Pro He Cys Leu Pro Asp Ala Ser His Val Phe Pro Ala 
115 120 125 

ggc aag gcc ate tgg gtc acg ggc tgg gga cac ace cag tat gga ggc 2293 
Gly Lys Ala He Trp Val Thr Gly Trp Gly His Thr Gin Tyr Gly Gly 
130 135 140 

act ggc gcg ctg ate ctg caa aag ggt gag ate cgc gtc ate aac cag 2341 
Thr Gly Ala Leu He Leu Gin Lys Gly Glu He Arg Val He Asn Gin 
145 150 155 

acc acc tgc gag aac etc ctg ccg cag cag ate acg ccg cgc atg atg 2389 
Thr Thr Cys Glu Asn Leu Leu Pro Gin Gin He Thr Pro Arg Met Met 
160 165 170 175 

tgc gtg ggc ttc etc age ggc ggc gtg gac tec tgc cag ggt gat tec 2437 
Cys Val Gly Phe Leu Ser Gly Gly Val Asp Ser Cys Gin Gly Asp Ser 

180 185 190 
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ggg gga ccc ctg tec age gtg gag gcg gat ggg egg ate ttc cag gee 24 85 

Gly Gly Pro Leu Ser Ser Val Glu Ala Asp Gly Arg lie Phe Gin Ala 
195 200 205 

ggt gtg gtg age tgg gga gac ggc tgc get cag agg aac aag cca ggc 2533 
Gly Val Val Ser Trp Gly Asp Gly Cys Ala Gin Arg Asn Lys Pro Gly 
210 215 220 

gtg tae aca agg cte cct ctg ttt egg gae tgg ate aaa gag aac act 2581 
Val Tyr Thr Arg Leu Pro Leu Phe Arg Asp Trp lie Lys Glu Asn Thr 
225 230 235 

ggg gta tag gggccggggc cacccaaatg tgtacacctg cggggceacc 2630 

Gly Val * 

240 

catcgtccac cccagtgtgc acgcctgcag getggagact ggaeegctga ctgcaccagc 2690 

geececagaa catacactgt gaactcaate tccagggctc caaatctgcc tagaaaacet 2750 

etcgetteet cagcctccaa agtggagctg ggaggtagaa ggggaggaca ctggtggttc 2 BIO 

tactgaccca actgggggca aaggtttgaa gacacagcct cccecgceag ecccaagctg 2B70 

ggccgaggcg cgtttgtgta tatctgcctc ccctgtctgt aaggagcagc gggaacggag 2 93 0 

cttcggagcc tcctcagtga aggtggtggg gctgceggat ctgggetgtg gggcecttgg 2990 

gccaegctct tgaggaagcc caggctcgga ggaecetgga aaacagaegg gtetgagaet 3 050 

gaaattgttt taccagctcc cagggtggac ttcagtgtgt gtatttgtgt aaatgggtaa 3110 

aaeaatttat ttcttttcaa aaaaaaaaaa aaaaaaa 314 7 

<210> 4 

<211> 241 

<212> PRT 

<213> Homo Sapien 

<400> 4 

Val Val Gly Gly Thr Asp Ala Asp Glu Gly Glu Trp Pro Trp Gin Val 

IS 10 15 

Ser Leu His Ala Leu Gly Gin Gly Hie lie Cys Gly Ala Ser Leu lie 

20 25 30 

Ser Pro Asn Trp Leu Val Ser Ala Ala His Cys Tyr He Asp Asp Arg 

35 40 45 

Gly Phe Arg Tyr Ser Asp Pro Thr Gin Trp Thr Ala Phe Leu Gly Leu 

50 55 60 

His Asp Gin Ser Gin Arg Ser Ala Pro Gly Val Gin Glu Arg Arg Leu 
65 70 75 80 

Lys Arg He He Ser His Pro Phe Phe Asn Asp Phe Thr Phe Asp Tyr 

85 90 95 

Asp He Ala Leu Leu Glu Leu Glu Lys Pro Ala Glu Tyr Ser Ser Met 

100 105 110 

Val Arg Pro He Cys Leu Pro Asp Ala Ser His Val Phe Pro Ala Gly 

115 120 125 

Lys Ala He Trp Val Thr Gly Trp Gly His Thr Gin Tyr Gly Gly Thr 

130 135 140 

Gly Ala Leu He Leu Gin Lys Gly Glu He Arg Val He Asn Gin Thr 
145 150 155 160 

Thr Cys Glu Asn Leu Leu Pro Gin Gin He Thr Pro Arg Met Met Cys 

165 170 175 

Val Gly Phe Leu Ser Gly Gly Val Asp Ser Cys Gin Gly Asp Ser Gly 

180 185 190 

Gly Pro Leu Ser Ser Val Glu Ala Asp Gly Arg He Phe Gin Ala Gly 
195 200 205 
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Val Val Ser Trp Gly Asp Gly Cys Ala Gin Arg Asn Lys Pro Gly Val 

210 215 220 

Tyr Thr Arg Leu Pro Leu Phe Arg Asp Trp He Lys Glu Asn Thr Gly 
225 230 235 240 

Val 

<210> 5 

<211> 804 

<212> DNA 

<213> Homo Sapien 

<220> 

<:221> CDS 

<222> (1) . . . (804) 

<223> Nucleotide sequence encoding MTSP20, including 
MTSP20 protease domain 

<400> 5 

aca gca ggt ccc cag gea gga gca ccc tec cca tgg ccc tgg gag gcc 
Thr Ala Gly Pro Gin Ala Gly Ala Pro Ser Pro Trp Pro Trp Glu Ala 
15 10 15 

agg ctg atg cac cag gga cag ctg gcc tgt ggc gga gcc ctg gtg tea 
Arg Leu Met His Gin Gly Gin Leu Ala Cys Gly Gly Ala Leu Val Ser 
20 25 30 

gag gag acg gtg eta act gtt gcc cac tgc ttc att ggg cgc cag gcc 
Glu Glu Thr Val Leu Thr Val Ala His Cys Phe He Gly Arg Gin Ala 
35 40 45 



4B 



96 



144 



288 



cca gag gaa tgg age gta ggg ctg ggg acc aga ccg gag gag tgg ggc 192 
Pro Glu Glu Trp Ser Val Gly Leu Gly Thr Arg Pro Glu Glu Trp Gly 
50 55 60 

ctg aag cag etc ate ctg cat gga gcc tac acc cac cct gag ggg ggc 240 
Leu Lys Gin Leu He Leu His Gly Ala Tyr Thr His Pro Glu Gly Gly 
65 70 75 80 

tac gac atg gcc etc ctg ctg ctg gcc cag cct gtg aca ctg gga gcc 
Tyr Asp Met Ala Leu Leu Leu Leu Ala Gin Pro Val Thr Leu Gly Ala 

85 90 95 

age ctg egg ccc etc tgc ctg ccc tat cct gac cac cac ctg cct gat 3 36 

Ser Leu Arg Pro Leu Cys Leu Pro Tyr Pro Asp His His Leu Pro Asp 
100 105 110 

ggg gag cgt ggc tgg gtt ctg gga egg gcc cgc cca gga gca ggc ate 384 
Gly Glu Arg Gly Trp Val Leu Gly Arg Ala Arg Pro Gly Ala Gly He 
115 120 125 

age tec etc cag aca gtg ccc gtg acc etc ctg ggg cct agg gcc tgc 432 
Ser Ser Leu Gin Thr Val Pro Val Thr Leu Leu Gly Pro Arg Ala Cys 
130 135 140 

age egg ctg cat gca get cct ggg ggt gat ggc age cct att ctg ccg 480 
Ser Arg Leu His Ala Ala Pro Gly Gly Asp Gly Ser Pro He Leu Pro 
145 150 155 160 



ggg atg gtg tgt ace agt get gtg ggt gag ctg ccc age tgt gag ggc 



528 
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Gly Met Val Cys Thr Ser Ala Val Gly Glu Leu Pro Ser Cys Glu Gly 

165 170 175 

ctg tct ggg gca cca ctg gtg cat gag gtg agg ggc aca tgg ttc ctg 576 
Leu Ser Gly Ala Pro Leu Val His Glu Val Arg Gly Thr. Trp Phe Leu 
180 185 190 

gcc ggg ctg cac age ttc gga gat get tgc caa ggc ccc gcc agg ccg 624 
Ala Gly Leu His Ser Phe Gly Asp Ala Cys Gin Gly Pro Ala Arg Pro 
195 200 205 

gcg gtc ttc acc gcg etc cct gcc tat gag gac tgg gtc age agt ttg 672 
Ala Val Phe Thr Ala Leu Pro Ala Tyr Glu Asp Trp Val Ser Ser Leu 
210 215 220 

gac tgg cag gtc tac ttc gcc gag gaa cca gag ccc gag get gag cct 720 
Asp Trp Gin Val Tyr Phe Ala Glu Glu Pro Glu Pro Glu Ala Glu Pro 
225 230 235 240 

gga age tgc ctg gcc aac atg agt atg tgg ccc egg ggc etc ctg cca 768 
Gly Ser Cys Leu Ala Asn Met Ser Met Trp Pro Arg Gly Leu Leu Pro 

245 250 255 

aac cct gcc tct cca gga ccc ttc tct etc cag tga 804 
Asn Pro Ala Ser Pro Gly Pro Phe Ser Leu Gin * 
260 265 



<210> 6 

<211> 267 

<212> PRT 

<213> Homo Sapien 



<400> 6 


























Thr 


Ala 


Gly 


Pro 


Gin 


Ala 


Gly 


Ala 


Pro 


Ser 


Pro 


Trp 


Pro 


Trp Glu Ala 


1 








5 










10 








15 


Arg 


Leu 


Met 


HiB 


Gin 


Gly Gin 


Leu 


Ala 


Cys 


Gly Gly 


Ala 


Leu Val Ser 








20 










25 










30 


Glu 


Glu 


Thr 


Val 


Leu 


Thr 


Val 


Ala 


His 


Cys 


Phe 


He 


Gly Arg Gin Ala 






35 










40 










45 




Pro 


Glu 


Glu 


Trp 


Ser 


Val 


Gly 


Leu 


Gly Thr Arg 


Pro 


Glu 


Glu Trp Gly 




50 










55 










60 






Leu 


Lys 


Gin 


Leu 


He 


Leu 


His 


Gly Ala 


Tyr 


Thr 


His 


Pro 


Glu Gly Gly 


65 










70 










75 






80 


Tyr 


Asp 


Met 


Ala 


Leu 


Leu 


Leu 


Leu 


Ala 


Gin 


Pro 


Val 


Thr 


Leu Gly Ala 










85 










90 








95 


Ser 


Leu 


Arg 


Pro 


Leu 


Cys 


Leu 


Pro 


Tyr 


Pro 


Asp 


His 


His 


Leu Pro Asp 








100 










105 










110 


Gly 


Glu 


Arg 


Gly 


Trp 


Val 


Leu 


Gly 


Arg 


Ala 


Arg 


Pro 


Gly Ala Gly He 






115 










120 










125 




Ser 


Ser 


Leu 


Gin 


Thr 


Val 


Pro 


Val 


Thr 


Leu 


Leu 


Gly 


Pro 


Arg Ala Cys 




130 










135 










140 






Ser 


Arg 


Leu 


His 


Ala 


Ala 


Pro 


Gly Gly Asp 


Gly Ser 


Pro 


He Leu Pro 


145 










150 










155 






160 


Gly 


Met 


Val 


Cys 


Thr 


Ser 


Ala 


Val 


Gly 


Glu 


Leu 


Pro 


Ser 


Cys Glu Gly 










165 










170 








175 


Leu 


Ser 


Gly 


Ala 


Pro 


Leu 


Val 


His 


Glu 


Val 


Arg 


Gly 


Thr 


Trp Phe Leu 








IBO 










185 










190 


Ala 


Gly 


Leu 


His 


Ser 


Phe 


Gly 


Asp 


Ala 


Cys 


Gin Gly 


Pro 


Ala Arg Pro 
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195 










200 


Ala 


Val 
210 


Phe 


Thr 


Ala 


Leu 


Pro 
215 


Ala 


Asp 


Trp 


Gin 


Val 


Tyr 


Phe 


Ala 


Glu 


225 










230 






Gly 


Ser 


Cys 


Leu 


Ala 
245 


Asn 


Met 


Ser 


Asn 


Pro 


Ala 


Ser 


Pro 


Gly 


Pro 


Phe 



260 



<210> 7 
<211> 30 
<212> DNA 

<:213> Artificial Sequence 
<220> 

<223> 5' end primer 
<:400> 7 

gaagcagctc atcctgcatg gagcctacac 



205 

Tyr Glu Asp Trp Val Ser Ser Leu 
220 

Glu Pro Glu Pro Glu Ala Glu Pro 
235 240 
Met Trp Pro Arg Gly Leu Leu Pro 

250 255 
Ser Leu Gin 
265 



<210> 8 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 3' end primer 
<400> 8 

ggccaggaac catgtgcccc tcacctc 27 

«:210> 9 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR 5' end primer 
<400> 9 

gaagcagctc atcctgcatg gagcctacac 29 

<210> 10 
<211> 27 
<212> DNA 

«:213> Artificial Sequence 
<220> 

<223> PCR 3' end primer 
<400> 10 

ggccaggaac catgtgcccc tcacctc 27 

c210> 11 
<211> 30 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> PCR 5' end primer 
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<400> 11 

gaggcactta tctgtttcag cctgtggatc 30 

<210> 12 
«:211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR 3' end primer 
<400> 12 

gtgggttaga acggcacgtt ctcactggag 30 

<210> 13 
<211> 9276 
<212> DNA 

<213> Pichia pastoris 



<400> 13 

agate taaca tccaaagacg aaaggttgaa tgaaaccttt ttgccatccg acatccacag 60 

gtccattctc acacataagt gccaaacgca acaggagggg atacactagc agcagaccgt 12 0 

tgcaaacgca ggacctccac tcctcttctc ctcaacaccc acttttgcca tcgaaaaacc 180 

agcccagtta ttgggcttga ttggagctcg ctcattccaa ttccttctat taggctacta 240 

acaccatgac tttattagcc tgtctatcct ggcccccctg gcgaggttca tgtttgttta 300 

tttccgaatg caacaagctc cgcattacac ccgaacatca ctccagatga gggctttctg 360 

agtgtggggt caaatagttt catgttcccc aaatggccca aaactgacag tttaaacgct 420 

gtcttggaac ctaatacgac aaaagcgtga tctcatccaa gatgaactaa gtttggttcg 480 

ttgaaatgct aacggccagt tggtcaaaaa gaaacttcca aaagtcgcca taccgtttgt S40 

cttgtttggt attgattgac gaatgctcaa aaataatctc attaatgctt agcgcagtct 600 

ctctatcgct tctgaacccc ggtgcacctg tgccgaaacg caaatgggga aacacccgct 660 

ttttggatga ttatgcattg tctccacatt gtatgcttcc aagattctgg tgggaatact 72 0 

gctgatagcc taacgttcat gatcaaaatt taactgttct aacccctact tgacagcaat 780 

atataaacag aaggaagctg ccctgtctta aacctttttt tttatcatca ttattagctt 84 0 

actttcataa ttgcgactgg ttccaattga caagcttttg attttaacga cttttaacga 90 0 

caacttgaga agatcaaaaa acaactaatt attcgaagga tccaaacgat gagatttcct 960 

tcaattttta ctgcagtttt attcgcagca tcctccgcat tagctgctcc agtcaacact 102 0 

acaacagaag atgaaacggc acaaattccg gctgaagctg tcatcggtta ctcagattta 10 80 

gaaggggatt tcgatgttgc tgttttgcca ttttccaaca gcacaaataa cgggttattg 114 0 

tttataaata ctactattgc cagcattgct gctaaagaag aaggggtatc tctcgagaaa 12 00 

agagaggctg aagcttacgt agaattccct agggcggccg cgaattaatt cgccttagac 12 60 

atgactgttc ctcagttcaa gttgggcact tacgagaaga ccggtcttgc tagattctaa 1320 

tcaagaggat gtcagaatgc catttgcctg agagatgcag gcttcatctt tgatactttt 13 80 

ttatttgtaa cctatatagt ataggatttt ttttgtcatt ttgtttcttc tcgtacgagc 144 0 

ttgctcctga tcagcctatc tcgcagctga tgaatatctt gtggtagggg tttgggaaaa 1500 

tcattcgagt ttgatgtttt tcttggtatt tcccactcct cttcagagta cagaagatta 1560 

agtgagaagt tcgtttgtgc aagcttatcg ataagcttta atgcggtagt ttatcacagt 162 0 

taaattgcta acgcagtcag gcaccgtgta tgaaatctaa caatgcgctc atcgtcatcc 1680 

tcggcaccgt caccctggat gctgtaggca taggcttggt tatgccggta ctgccgggcc 1740 

tcttgcggga tatcgtccat tccgacagca tcgccagtca ctatggcgtg ctgctagcgc 1800 

tatatgcgtt gatgcaattt ctatgcgcac ccgttctcgg agcactgtcc gaccgctttg 1860 

gccgccgccc agtcctgctc gcttcgctac ttggagccac tatcgactac gcgatcatgg 1920 

cgaccacacc cgtcctgtgg atctatcgaa tctaaatgta agttaaaatc tctaaataat 1980 

taaataagtc ccagtttctc catacgaacc ttaacagcat tgcggtgagc atctagacct 204 0 

tcaacagcag ccagatccat cactgcttgg ccaatatgtt tcagtccctc aggagttacg 210 0 

tcttgtgaag tgatgaactt ctggaaggtt gcagtgttaa ctccgctgta ttgacgggca 216 0 

tatccgtacg ttggcaaagt gtggttggta ccggaggagt aatctccaca actctctgga 2220 

gagtaggcac caacaaacac agatccagcg tgttgtactt gatcaacata agaagaagca 2280 

ttctcgattt gcaggatcaa gtgttcagga gcgtactgat tggacatttc caaagcctgc 2340 
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tcgtaggctg caaccgatag ggttgtagag tgtgcaatac acttgcgtac aatttcaacc 24 00 

cttggcaact gcacagcttg gttgtgaaca gcatcttcaa ttctggcaag ctccttgtct 24 60 

gtcatatcga cagccaacag aatcacctgg gaatcaatac catgttcagc ttgagacaga 2S20 

aggtctgagg caacgaaatc tggatcagcg tatttatcag caataactag aacttcagaa 2S80 

ggcccagcag gcatgtcaat actacacagg gctgatgtgt cattttgaac catcatcttg 2640 

gcagcagtaa cgaactggtt tcctggacca aatattttgt cacacttagg aacagtttct 2700 

gttccgtaag ccatagcagc tactgcctgg gcgcctcctg ctagcacgat acacttagca 2760 

ccaaccttgt gggcaacgta gatgacttct ggggtaaggg taccatcctt cttaggtgga 2820 

gatgcaaaaa caattccttt gcaaccagca actttggcag gaacacccag catcagggaa 2880 

gtggaaggca gaattgcggt tccaccagga atatagaggc caactttctc aataggtctt 294 0 

gcaaaacgag agcagactac accagggcaa gtctcaactt gcaacgtctc cgttagttga 30 00 

gcttcatgga atttcctgac gttatctata gagagatcaa tggctctctt aacgttatct 306 0 

ggcaattgca taagttcctc tgggaaagga gcttctaaca caggtgtctt caaagcgact 3120 

ccatcaaact tggcagttag ttctaaaagg gctttgtcac cattttgacg aacattgtcg 3180 

acaattggtt tgactaattc cataatctgt tccgttttct ggataggacg acgaagggca 3240 

tcttcaattt cttgtgagga ggccttagaa acgtcaattt tgcacaattc aatacgacct 3300 

tcagaaggga cttctttagg tttggattct tctttaggtt gttccttggt gtatcctggc 3360 

ttggcatctc ctttccttct agtgaccttt agggacttca tatccaggtt tctctccacc 3420 

tcgtccaacg tcacaccgta cttggcacat ctaactaatg caaaataaaa taagtcagca 34 80 

cattcccagg ctatatcttc cttggattta gcttctgcaa gttcatcagc ttcctcccta 354 0 

attttagcgc tcaacaaaac ttcgtcgtca aataaccgtt tggtataaga accttctgga 3 60 0 

gcattgctct tacgatccca caaggtggct tccatggctc taagaccctt tgattggcca 3660 

aaacaggaag tgcgttccaa gtgacagaaa ccaacacctg tttgttcaac cacaaatttc 372 0 

aagcagtctc catcacaatc caattcgata cccagcaact tttgagttgc tccagatgta 3760 

gcacctttat accacaaacc gtgacgacga gattggtaga ctccagtttg tgtccttata 384 0 

gcctccggaa tagacttttt ggacgagtac accaggccca acgagtaatt agaagagtca 3 900 

gccaccaaag tagtgaatag accatcgggg cggtcagtag tcaaagacgc caacaaaatt 3 960 

tcactgacag ggaacttttt gacatcttca gaaagttcgt attcagtagt caattgccga 4 020 

gcatcaataa tggggactat accagaagca acagtggaag tcacatctac caactttgcg 4 080 

gtctcagaaa aagcataaac agttctacta ccgccattag tgaaactttt caaatcgccc 414 0 

agtggagaag aaaaaggcac agcgatacta gcattagcgg gcaaggatgc aactttatca 42 00 

accagggtcc tatagataac cctagcgcct gggatcatcc tttggacaac tctttctgcc 4260 

aaatctaggt ccaaaatcac ttcattgata ccattattgt acaacttgag caagttgtcg 4 320 

atcagctcct caaattggtc ctctgtaacg gatgactcaa cttgcacatt aacttgaagc 4380 

tcagtcgatt gagtgaactt gatcaggtcg tgcagctggt cagcagcata gggaaacacg 444 0 

gcttttccta ccaaactcaa ggaattatca aactctgcaa cacttgcgta tgcaggtagc 4500 

aagggaaatg tcatacttga agtcggacag tgagtgtagt cttgagaaat tctgaagccg 4560 

tatttttatt atcagtgagt cagtcatcag gagatcctct acgccggacg catcgtggcc 462 0 

gacctgcagg ggggggggg9 gcgctgaggt ctgcctcgtg aagaaggtgt tgctgactca 4680 

taccaggcct gaatcgcccc atcatccagc cagaaagtiga gggagccacg gttgatgaga 474 0 

gctttgttgt aggtggacca gttggtgatt ttgaactttt gctttgccac ggaacggtct 4800 

gcgttgtcgg gaagatgcgt gatctgatcc ttcaactcag caaaagttcg atttattcaa 4860 

caaagccgcc gtcccgtcaa gtcagcgcaa tgctctgcca gtgttacaac caattaacca 4 920 

attctgatta gaaaaactca tcgagcatca aatgaaactg caatttattc atatcaggat 4 980 

tatcaacacc atatttttga aaaagccgtt tctgtaatga aggagaaaac tcaccgaggc 504 0 

agttccatag gatggcaaga tcctggtatc ggtctgcgat tccgactcgt ccaacatcaa 5100 

tacaacccat taatttcccc tcgtcaaaaa taaggttatc aagtgagaaa tcaccatgag 5160 

tgacgactga atccggtgag aatggcaaaa gcttatgcat ttctttccag acttgttcaa 5220 

caggccagcc attacgctcg tcatcaaaat cactcgcatc aaccaaaccg ttattcattc 52 80 

gtgattgcgc ctgagcgaga cgaaatacgc gatcgctgtt aaaaggacaa ttacaaacag 5340 

gaatcgaatg caaccggcgc aggaacactg ccagcgcatc aacaatattt tcacctgaat 5400 

caggatattc ttctaatacc tggaatgctg ttttcccggg gatcgcagtg gtgagtaacc 5460 

atgcatcatc aggagtacgg ataaaatgct tgatggtcgg aagaggcata aattccgtca 5520 

gccagtttag tctgaccatc tcatctgtaa catcattggc aacgctacct ttgccatgtt 5580 

tcagaaacaa ctctggcgca tcgggcttcc catacaatcg atagattgtc gcacctgatt 5640 

gcccgacatt atcgcgagcc catttatacc catataaatc agcacccatg ttggaattta 57 0 0 

atcgcggcct cgagcaagac gtttcccgtt gaatatggct cataacaccc cttgtattac 5760 

tgtttatgta agcagacagc ttcattgttc atgatgatat atttttatct tgtgcaatgt 5820 

aacatcagag attttgagac acaacgtggc tttccccccc ccccctgcag gtcggcatca 5880 
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ccggcgccac aggtgcggtt gctggcgcct atatcgccga catcaccgat ggggaagatc 5 940 

gggctcgcca cttcgggctc atgagcgctt gtttcggcgt gggtatggtg gcaggccccg 6000 

tggccggggg actgttgggc gccatctcct tgcatgcacc attccttgcg gcggcggtgc 6060 

tcaacggcct caacctacta ctgggctgct tcctaatgca ggagtcgcat aagggagagc 612 0 

gtcgagtatc tatgattgga agtatgggaa tggtgatacc cgcattcttc agtgtcttga 6180 

gcftctcctat cagattatgc ccaactaaag caaccggagg aggagatttc atggtaaatt 624 0 

tctctgactt ttggtcatca gtagactcga actgtgagac tatctcggtt atgacagcag 6300 

aaatgtcctt cttggagaca gtaaatgaag tcccaccaat aaagaaatcc ttgttatcag 6360 

gaacaaactt cttgtttcga actttttcgg tgccttgaac tataaaatgt agagtggata 6420 

tgtcgggtag gaatggagcg ggcaaatgct taccttctgg accttcaaga ggtatgtagg 6460 

gtttgtagat actgatgcca acttcagtga caacgttgct atttcgttca aaccattccg 6540 

aatccagaga aatcaaagtt gtttgtctac tattgatcca agccagtgcg gtcttgaaac 6600 

tgacaatagt gtgctcgtgt tttgaggtca tctttgtatg aataaatcta gtctttgatc 6660 

taaataatct tgacgagcca aggcgataaa tacccaaatc taaaacCctt ttaaaacgtt 6720 

aaaaggacaa gtatgtctgc ctgtattaaa ccccaaatca gctcgtagtc tgatcctcat 6780 

caacttgagg ggcactatct tgttttagag aaatttgcgg agatgcgata tcgagaaaaa 6840 

ggtacgctga ttttaaacgt gaaatttatc tcaagatctc tgcctcgcgc gtttcggtga 6900 

tgacggtgaa aacctctgac acatgcagct cccggagacg gtcacagcct gtctgtaagc 6960 

ggatgccggg agcagacaag cccgtcaggg cgcgtcagcg ggtgttggcg ggtgtcgggg 7020 

cgcagccatg acccagtcac gtagcgatag cggagtgtat actggcttaa ctatgcggca 7080 

tcagagcaga ttgtactgag agtgcaccat atgcggtgtg aaataccgca cagatgcgta 714 0 

aggagaaaat accgcatcag gcgctcttcc gcttcctcgc tcactgactc gctgcgctcg 7200 

gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca 7260 

gaatcagggg ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac 7320 

cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac 7380 

aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg 744 0 

tttccccctg gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac 7500 

ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc aatgctcacg ctgtaggtat 7560 

ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag 7620 

cccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac 7680 

ttatcgccac tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt 774 0 

gctacagagt tcttgaagtg gtggcctaac tacggctaca ctagaaggac agtatttggt 7800 

atctgcgctc tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc 7860 

aaacaaacca ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga 7920 

aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac 7 98 0 

gaaaactcac gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc 804 0 

cttttaaatt aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct 8100 

gacagctacc aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca 8160 

tccatagttg cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct 822 0 

ggccccagtg ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca 8280 

ataaaccagc cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc 8340 

atccagtcta ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg 8400 

cgcaacgttg ttgccattgc tgcaggcatc gtggtgtcac gctcgtcgtt tggtatggct 8460 

tcattcagct ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa 852 0 

aaagcggtta gccccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta 858 0 

tcactcatgg ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc 864 0 

ttttctgtga ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg 870 0 

agttgctctt gcccggcgtc aacacgggat aataccgcgc cacatagcag aactttaaaa 876 0 

gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg 882 0 

agatccagtt cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc 8 880 

accagcgttt ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg 8 94 0 

gcgacacgga aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat 900 0 

cagggctatt gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata 9060 

ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg tctaagaaac cattattatc 9120 

atgacattaa cctataaaaa taggcgtatc acgaggccct ttcgtcttca agaattaatt 9180 

ctcatgtttg acagcttatc atcgataagc tgactcatgt tggtattgtg aaatagacgc 9240 

agatcgggaa cactgaaaaa taacagttat tattcg 9276 
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<210> 14 
<211> 11 
<212> PRT 

<213> Pichia protease cleavage site 
<400> 14 

Lys Arg lie Ala Ser Gly Val lie Ala Pro Lys 
15 10 



<210> 15 

<211> 1973 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> {24} . . . (1952) 

<223> Nucleotide sequence encoding full-length MTSP20, including 
MTSP20 protease domains 

<400> 15 

gcctcgggcg gaacctggag at a atg ggc age acc tgg ggg age cet ggc tgg 53 

Met Gly Ser Thr Trp Gly Ser Pro Gly Trp 
15 10 

gtg egg etc get ctt tgc ctg aeg ggc tta gtg etc teg etc tae geg 101 
Val Arg Leu Ala Leu Cys Leu Thr Gly Leu Val Leu Ser Leu Tyr Ala 

15 20 25 

ctg eac gtg aag geg gcg cgc gee egg gae egg gat tac cgc gcg etc 14 9 
Leu His Val Lys Ala Ala Arg Ala Arg Asp Arg Asp Tyr Arg Ala Leu 
30 35 40 

tgc gac gtg ggc acc gcc ate age tgt teg cgc gtc ttc tec tec agg 197 
Cys Asp Val Gly Thr Ala lie Ser Cys Ser Arg Val Phe Ser Ser Arg 
45 50 55 

tgg ggc agg ggt ttc ggg ctg gtg gag eat gtg ctg gga cag gac age 245 
Trp Gly Arg Gly Phe Gly Leu Val Glu His Val Leu Gly Gin Asp Ser 
60 65 70 

ate etc aat caa tec aac age ata ttc ggt tgc ate ttc tac aca eta 293 
lie Leu Asn Gin Ser Asn Ser He Phe Gly Cys He Phe Tyr Thr Leu 
75 80 B5 90 

cag eta ttg tta ggt. ctt caa gcc get cag cgt gcc tgt gga cag cgt 341 
Gin Leu Leu Leu Gly Leu Gin Ala Ala Gin Arg Ala Cys Gly Gin Arg 

95 100 105 

ggc ccc ggc ecc cec aag cet cag gag ggc aac aca gtc cet ggc gag 389 
Gly Pro Gly Pro Pro Lys Pro Gin Glu Gly Asn Thr Val Pro Gly Glu 
110 115 120 

tgg ccc tgg cag gcc agt gtg agg agg caa gga gcc cac ate tgc age 43 7 
Trp Pro Trp Gin Ala Ser Val Arg Arg Gin Gly Ala His He Cys Ser 
125 130 135 

ggc tec ctg gtg gca gac acc tgg gtc etc act get gcc cac tgc ttt 4 85 
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Gly Ser Leu Val Ala Asp Thr Trp Val Leu Thr Ala Ala His Cys Phe 
140 145 ISO 

gaa aag gca gca gca aca gaa ctg aat tec tgg tea gtg gtc ctg ggt 533 
Glu Lys Ala Ala Ala Thr Glu Leu Asn Ser Trp Ser Val Val Leu Gly 
155 160 165 170 

tct ctg cag cgt gag gga etc age ect ggg gcc gaa gag gtg ggg gtg 581 
Ser Leu Gin Arg Glu Gly Leu Ser Pro Gly Ala Glu Glu Val Gly Val 

175 180 1B5 

get gcc ctg cag ttg ccc agg gcc tat aac cac tac age cag ggc tea 629 
Ala Ala Leu Gin Leu Pro Arg Ala Tyr Asn His Tyr Ser Gin Gly Ser 
190 195 200 

gac ctg gcc ctg ctg cag etc gcc cac cec acg acc cac aca ccc etc 677 
Asp Leu Ala Leu Leu Gin Leu Ala His Pro Thr Thr His Thr Pro Leu 
205 210 215 

tgc ctg ccc cag ccc gee cat cgc ttc ccc ttt gga gcc tec tgc tgg 725 
Cys Leu Pro Gin Pro Ala His Arg Phe Pro Phe Gly Ala Ser Cys Trp 
220 225 230 



gcc act ggc tgg gat cag gac acc agt gat get cct ggg acc eta cgc 
Ala Thr Gly Trp Asp Gin Asp Thr Ser Asp Ala Pro Gly Thr Leu Arg 
235 240 245 250 



773 



aat ctg cgc ctg cgt etc ate agt cgc ccc aca tgt aac tgt ate tac 821 
Asn Leu Arg Leu Arg Leu lie Ser Arg Pro Thr Cys Asn Cys lie Tyr 

255 260 265 

aac cag ctg cac cag cga cac ctg tec aac ccg gcc egg cct ggg atg 8S9 
Asn Gin Leu His Gin Arg His Leu Ser Asn Pro Ala Arg Pro Gly Met 
270 275 280 

eta tgt ggg ggc ccc cag cct ggg gtg cag ggc ccc tgt cag gga gat 917 
Leu Cys Gly Gly Pro Gin Pro Gly Val Gin Gly Pro Cys Gin Gly Asp 
285 290 295 

tee ggg ggc cct gtg ctg tgc etc gag cct gac gga cac tgg gtt cag 965 
Ser Gly Gly Pro Val Leu Cys Leu Glu Pro Asp Gly His Trp Val Gin 
300 305 310 

get ggc ate ate age ttt gca tea age tgt gcc cag gag gac get cct 1013 
Ala Gly lie He Ser Phe Ala Ser Ser Cys Ala Gin Glu Asp Ala Pro 
315 320 325 330 

gtg ctg ctg acc aac aca get get cac agt tec tgg ctg cag get cga 1061 
Val Leu Leu Thr Asn Thr Ala Ala His Ser Ser Trp Leu Gin Ala Arg 

335 340 345 

gtt cag ggg gca get ttc ctg gcc cag age eca gag acc ccg gag atg 110 9 
Val Gin Gly Ala Ala Phe Leu Ala Gin Ser Pro Glu Thr Pro Glu Met 
350 355 360 

agt gat gag gac age tgt gta gcc tgt gga tec ttg agg aca gca ggt 1157 
Ser Asp Glu Asp Ser Cys Val Ala Cys Gly Ser Leu Arg Thr Ala Gly 
365 370 375 
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ccc cag gca gga gca ccc tec cca tgg ccc tgg gag gcc agg ctg atg 1205 
Pro Gin Ala Gly Ala Pro Ser Pro Trp Pro Trp Glu Ala Arg Leu Met 
380 385 390 

cac cag gga cag ctg gcc tgt ggc gga gcc ctg gtg tea gag gag gcg 1253 
His Gin Gly Gin Leu Ala Cys Gly Gly Ala Leu Val Ser Glu Glu Ala 
395 400 405 410 



gtg eta act get gcc cac tgc ttc att ggg cgc cag gcc cca gag gaa 
Val Leu Thr Ala Ala His Cys Phe lie Gly Arg Gin Ala Pro Glu Glu 

415 420 425 



etc ate ctg cat gga gcc tac acc cac cct gag ggg ggc tac gac atg 
Leu lie Leu His Gly Ala Tyr Thr His Pro Glu Gly Gly Tyr Asp Met 
445 450 455 

gcc etc ctg ctg ctg gcc cag cct gtg aca ctg gga gcc age ctg egg 
Ala Leu Leu Leu Leu Ala Gin Pro Val Thr Leu Gly Ala Ser Leu Arg 
460 4S5 470 



1301 



tgg age gta ggg ctg ggg acc aga ccg gag gag tgg ggc ctg aag cag 1349 
Trp Ser Val Gly Leu Gly Thr Arg Pro Glu Glu Trp Gly Leu Lys Gin 
430 435 440 



1397 



1445 



ccc etc tgc ctg ccc tat cct gac cac cac ctg cct gat ggg gag cgt 14 93 
Pro Leu Cys Leu Pro Tyr Pro Asp His His Leu Pro Asp Gly Glu Arg 
475 480 485 490 

ggc tgg gtt ctg gga egg gcc cgc cca gga gca ggc ate age tee etc 1541 
Gly Trp Val Leu Gly Arg Ala Arg Pro Gly Ala Gly lie Ser Ser Leu 

495 500 505 

cag aca gtg ccc gtg acc etc ctg ggg cct agg gcc tgc age egg ctg 1589 
Gin Thr Val Pro Val Thr Leu Leu Gly Pro Arg Ala Cys Ser Arg Leu 
510 515 520 

eat gca get cct ggg ggt gat ggc age cct att ctg ccg ggg atg gtg 1637 
His Ala Ala Pro Gly Gly Asp Gly Ser Pro He Leu Pro Gly Met Val 
525 530 535 

tgt acc agt get gtg ggt gag ctg ccc age tgt gag ggc ctg tct ggg 1685 
Cys Thr Ser Ala Val Gly Glu Leu Pro Ser Cys Glu Gly Leu Ser Gly 
540 545 550 

gca cca ctg gtg cat gag gtg agg ggc aca tgg ttc ctg gcc ggg ctg 1733 
Ala Pro Leu Val His Glu Val Arg Gly Thr Trp Phe Leu Ala Gly Leu 
555 560 565 570 

cac age ttc gga gat get tgc caa ggc ccc gcc agg ccg gcg gtc ttc 1781 
His Ser Phe Gly Asp Ala Cys Gin Gly Pro Ala Arg Pro Ala Val Phe 

575 580 585 

acc gcg etc cct gee tat gag gac tgg gtc age agt ttg gac tgg cag 1829 
Thr Ala Leu Pro Ala Tyr Glu Asp Trp Val Ser Ser Leu Asp Trp Gin 
590 595 600 

gtc tac ttc gee gag gaa cca gag ccc gag get gag cct gga age tgc 1877 
Val Tyr Phe Ala Glu Glu Pro Glu Pro Glu Ala Glu Pro Gly Ser Cys 
605 610 615 



t 
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ctg gcc aac ata agt atg tgg ccc egg ggc etc ctg eca aac cct gcc 1925 
Leu Ala Asn lie Ser Met Trp Pro Arg Gly Leu Leu Pro Asn Pro Ala 
620 625 630 



tct cca gga ccc tte tct etc cag tga gaacgtgccg ttctaaccca 1972 
Ser Pro Gly Pro Phe Ser Leu Gin * 
635 640 

1973 



<210> 16 
<211> 642 
<212> PRT 

<213> Homo sapiens 



<400> 16 
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Gly 
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Gly 
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Gly 
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Pro 


Gly 


Glu 


Trp 


Pro 


Trp 
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Ala 


Ser 






115 








120 










125 








Val 


Arg 
130 


Arg 


Gin 


Gly 
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His 
135 
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Ser 
140 
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Glu 
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Ala 
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Ala 


Thr 


145 
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Glu 
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Trp 
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Ser 


Val 
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Leu 


Gly 
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Ser 
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Glu 
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Gly 
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Val 


Gly Val 
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180 
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Tyr 
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Tyr 


Ser 
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200 


Gly 


Ser 


Asp 


Leu 


Ala 
205 


Leu 


Leu 
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Leu 


Ala 
210 


His 


Pro 


Thr 


Thr 


His 
215 


Thr 


Pro 


Leu 


Cys 


Leu 
220 


Pro 


Gin 


Pro 


Ala 


His 


Arg 


Phe 


Pro 


Phe 


Gly Ala 


Ser 


Cys 


Trp 
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Thr 


Gly 


Trp 
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225 
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235 
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Thr 


Ser 
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Gly 
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Arg 


Asn 


Leu 


Arg 
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Arg 
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245 










250 
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Ser 
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He 
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His 
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Arg 






260 
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Leu 


Ser 
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280 


Gly 


Met 


Leu 


Cys 
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Gly 
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Cys 
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Gly Asp 


Ser Gly Gly 


Pro 


Val 
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290 










295 
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Glu 
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Gly His 


Trp 
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Gly 


He 


He 
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Ser 
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Thr 
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325 








330 










335 
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^ c c 
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Trp 
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o o c 
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Pro 
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Trp 
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He 


Leu 


Pro 


Gly 




530 
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Glu 


Leu 
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Ser 
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Glu 


Gly 


Leu 
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550 






Val 


Arg 


Gly 


Thr 


Trp 


Phe 


Leu 
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565 








uys 


Gin 


Gly 


Pro 


Ala 


Arg 


Pro 


Ala 








580 










Glu 


Asp 


Trp 


Val 


Ser 


Ser 


Leu Asp 






595 










600 


Pro 


Glu 


Pro 


Glu 


Ala 


Glu 


Pro 


Gly 




610 










615 




Trp 


Pro 


Arg 


Qly 


Leu 


Leu 


Pro 


Asn 


625 
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Leu Gin 



Ala 


Arg 


Val 


Gin 


Gly Ala 


Ala 


Pne 


345 










350 






Glu 


Met 


Ser 


Asp 


Glu 


Asp 


Ser 


Cys 










365 








Ala Gly 


Pro 


Gin 


Ala 


Gly Ala 
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Leu 


Met 
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Gin Gly Gin 


Leu 


t\±.Ci 






395 










^\J\J 


Glu 


Ala 


Val 


Leu 


Thr 
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Ala 


nxS 




410 










415 




Glu 


Glu 


Trp 


Ser Val Gly Leu 


wi.y 


425 
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Lys 


Gin 


Leu 


He 
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Gly 


TV "1 
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Ala 


Leu 
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Leu 


Arg 


Pro 


Leu 


Cys 


Leu 
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Glu Arg 


Gly Trp Val 


Leu 


Gly 


Arg 




490 
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Leu 
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Thr 


Val 
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Thr 
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Arg 


Leu 
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540 










Ser 


Gly 
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Leu 


Val 
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560 


Gly 
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575 
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Leu 
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Trp 
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Glu 


Glu 
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Ser 
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He 


Ser 
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Pro 
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Ser 
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Gly 


Pro 


Phe 


Ser 
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<210> 17 
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<212> PRT 
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<310> WO 02/000860 
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Leu 
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Cys 


Phe 


He 


Gly 


Arg 


Gin Ala 


Pro 


Glu 


Glu 


Trp 


Ser 


Val 


Gly 


Leu 


Gly 








420 








425 










430 






Thr 


Arg 


Pro 


Glu 


Glu Trp Gly 


Leu 


Lys 


Gin 


Leu 


He 


Leu 


His 


Gly Ala 






435 








440 










445 








Tyr 


Thr 


His 


Pro 


Glu 


Gly Gly Tyr Asp 


Met 


Ala 


Leu 


Leu 


Leu 


Leu 


Ala 




450 








455 










460 










Gin 


Pro 


Val 


Thr 


Leu 


Gly Ala 


Ser 


Leu 


Arg 


Pro 


Leu 


Cys 


Leu 


Pro 


Tyr 


465 










470 








475 










480 


Pro 


Asp 


His 


His 


Leu 


Pro Asp 


Gly 


Glu 


Arg 


Gly 


Trp 


Val 


Leu 


Gly Arg 










485 








490 










495 




Ala 


Arg 


Pro 


Gly 


Ala Gly lie 


Ser 


Ser 


Leu 


Gin 


Thr 


Val 


Pro 


Val 


Thr 






500 








SOS 










510 






Leu 


Leu 


Gly 


Pro 


Arg 


Ala Cys 


Ser 


Arg 


Leu 


His 


Ala 


Ala 


Pro 


Gly Gly 






515 








520 










525 








Asp 


Gly 


Ser 


Pro 


He 


Leu Pro 


Gly 


Met 


Val 


Cys 


Thr 


Ser 


Ala 


Val 


Gly 
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530 










535 








540 










Glu 


Leu 


P*ro 


Ser 


Cys 


Glu Gly Leu 


Ser 


Gly Ala 


Pro 


Leu 


Val 


His 


Glu 


545 








550 








555 










560 


val 


Arg 


Gly 


Thr 


Trp 


Phe 


Leu Ala 


Gly 


Leu 


His 


Ser 


Phe 


Gly Asp Ala 










565 








570 










575 




Cys 


Gin Gly 


Pro 


Ala 


Arg 


Pro Ala 


Val 


Phe 


Thr 


Ala 


Leu 


Pro 


Ala 


Tyr 






580 








585 










590 






Glu Asp 


Trp 


Val 


Ser 


Ser 


Leu Asp 


Trp 


Gin 


Val 


Tyr 


Phe 


Ala 


Glu 


Glu 






595 








600 










605 








Pro 


Glu 


Pro 


Glu 


Ala 


Glu 


Pro Gly Ser 


Cys 


Leu 


Ala 


Asn 


He 


Ser 


Gin 




610 










615 








620 










Pro 


Thr 


Ser 


Cya 

























625 

<210> 18 
c211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5' end primer 
<400> IB 

gctgcttcag gccccactcc tccggtctgg 30 

<210:> 19 
<211> 30 
<212> DNA 

<:213> Artificial Sequence 
<220> 

<223> nested 5' end primer 
<400> 19 

cacaggccag ctgtccctgg tgcatcagcc 

<210> 20 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<:223> 5' end primer 
<400> 20 

agcacggccg gccaatcgcc gagtcagagg 30 

<210> 21 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> sense 
<:400> 21 

gcctcgggcg gaacctggag ataatgggc 29 

<210> 22 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> antlBense primer 
<400> 22 

gaccaggaat tcagttctgt tgctgctgcc 30 



<210> 23 
«:211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<222> sense primer 
<400> 23 

aagcctcagg agggcaacac agtccctg 28 

<2I0> 24 
<211> 28 
<:212> DNA 

<213> Artificial Sequence 
<220> 

<223> ant i sense primer 
<400> 24 

acagctgggc agctcaccca cagcactgg 2 8 



